
MODERN 

QUANTUM 

MECHANICS 

SECONO EDITION 



JJSakurai * Jim Napolttano 







A Classic Made Current 



This classic text provides a graduate-level, non-historical, modern 
introduction to quantum mechanical concepts. The author, J. J. Sakurai, 
was a renowned particle theorist. This revision by Jim Napolitano retains 
original material .ind adds topics that extend the text’s relevance for 

I the 2 lst century. The introduction of new material and modification 

of existing material appear in a way that better prepares you for the next course in quan- 
tum field theory.You will still find classic developments such as neutron interferometer 
experiments, Feymnan path integrals, correlation measurements, and Bell's inequality. The - 
style and treatment of topics is now consistent across chapters. 



IIICiHLlCiH I S t)l 1HF. SHCON1) hDITION 

■ Chapter 6, Scattering Theory, is completely reorganized, with a new introduction 

base d on time dcpendeni pe rtu rbation rlieory 

» Schrodinger Wave Equation eoverage nøw includes explicit solutions, including 
the liiiear potential; the simple harmonic oscillator using generating fimctions. and the 

derivation of spherical harmonics. 

■ SO(4) symmetry discussion has been added to the earlj chapters, along with its 

application to solving the hydrogen atom and approximation-techniqties hased on 

extreme time dep cnd e nces. 

* Chapter 7, Identical Particles, has been expanded. to inciude the technique of 
second quantizatiøn and its åpphcatiøn to electronS in spnids ahd^ the . ieteccrp— 

magnetic field. 



Picase vjsit us at www.pearsonhighered.ccmi for nioré. infor mation. To order anydfourprodwcts, contact our 
customer service department at (800) 824-7799, (201). 767-502 1 outside df the' U.S:T or visit' your *cartipilis hookstore, 



WWW.pearSOnhi9her6d.COm 



Addison-Wesley 

is an imprintof 



PEARSON 





MODERN QUANTUM MECHANICS 



Second Edition 




I ~ l ' I 1 



\ 74 -.. 

^ S 2S 

2 o f ( 



A~^b 



MODERN QUANTUM MECHANICS 

Second Edition 



s 



Addison-Wesley 

Boston Columbus Indianapolis 
New York San Francisco UpperSaddle River 
Amsterdam Cape Town Dubai London 
Madrid Milan Munich Paris Montréal Toronto 
Delhi Mexico City Såo Paulo Sydney 
Hong Kong Seoul Singapore Taipei Tokyo 




Publisher: Jim Smith 

Director of Development: Michael Gillespie 
Editorial Manager: Laura Kenney 
Senior Project Editor: Katie Conley 
Editorial Assistant: Dyan Menezes 
Managing Editor: Corinne Benson 
Production Project Manager: Beth Collins 

Production Management, Composition, and Art Creation: Techsetters, Inc. 

Copyeditor: Connie Day 

Cover Designer: Blake Kim; Seventeenth Street Studios 

Photo Editor: Donna Kalal 

Manufacturing Buyer: Jeff Sargent 

Senior Marketing Manager: Kerry Chapman 

Cover Photo Illustration: Blake Kim 

Credits and acknowledgments borrowed from other sources and reproduced, with 
permission, in this textbook appear on the appropriate page within the text. 

Copyright © 1994, 2011 Pearson Education, Inc., publishing as Addison-Wesley, 1301 
Sansome Street, San Francisco, CA 94111. All rights reserved. Manufactured in the 
United States of America. This publication is protected by Copyright and permission 
should be obtained from the publisher prior to any prohibited reproduction, storage in a 
retrieval system, or transmission in any form or by any means, electronic, mechanical, 
photocopying, recording, or likewise. To obtain permission(s) to use material from this 
work, piease submit a written request to Pearson Education, Inc., Permissions 
Department, 1900 E. Lake Ave., Glenview, IL 60025. For information regarding 
permissions, call (847) 486-2635. 

Many of the designations used by manufacturers and seliers to distinguish their products 
are claimed as trademarks. Where those designations appear in this book, and the 
publisher was aware of a trademark claim, the designations have been printed in initial 
caps or all caps. 

Library of Congress Cataloging-in-Publication Data 

Sakurai, J. J. (Jun John), 1933-1982. 

Modem quantum mechanics. - 2nd ed. / J.J. Sakurai, Jim Napolitano. 
p. cm. 

ISBN 978-0-8053-8291-4 (alk. paper) 

1. Quantum theory-Textbooks. I. Napolitano, Jim. II. Title. 

QC174.12.S25 2011 
530.12-dc22 

2010022349 



ISBN 10: 0-8053-8291-7; ISBN 13: 978-0-8053-8291-4 
123456789 10— CRK— 14 13 12 11 10 

Addison-Wesley 

is an imprint of 



PEARSON 



www.pearsonhighered.com 




Contents 



Foreword to the First Edition ix 

Preface to the Revised Edition xi 

Preface to the Second Edition xiii 

In Memoriam xvii 

1 ■ Fundamental Concepts 1 



1 . 1 The Stern-Gerlach Experiment 1 

1.2 Kets, Bras, and Operators 10 

1.3 Base Kets and Matrix Representa tions 17 

1 .4 Measurements, Observables, and the Uncertainty Relations 23 

1.5 Change of Basis 35 

1.6 Position, Momentum, and Translation 40 

1.7 Wave Functions in Position and Momentum Space 50 

2 ■ Quantum Dynamics 66 

2.1 Time-Evolution and the Schrodinger Equation 66 

2.2 The Schrodinger Versus the Heisenberg Picture 80 

2.3 Simple Harmonic Oscillator 89 

2.4 Schrodinger’s Wave Equation 97 

2.5 Elementary Solutions to Schrodinger’s Wave Equation 103 

2.6 Propagators and Feynman Path Integrals 116 

2.7 Potentials and Gauge Transformations 129 

3 ■ Theory of Angular Momentum 157 

3 . 1 Rotations and Angular-Momentum Commutation Relations 1 57 

3.2 Spin | Systems and Finite Rotations 163 

3.3 SO(3), SU(2), and Euler Rotations 172 



v 




Contents 



3.4 Density Operators and Pure Versus Mixed Ensembles 178 

3.5 Eigenvalues and Eigenstates of Angular Momentum 191 

3.6 Orbital Angular Momentum 199 

3.7 Schrodinger’s Equation for Central Potentials 207 

3.8 Addition of Angular Momenta 217 

3.9 Schwinger’s Oscillator Model of Angular Momentum 232 

3.10 Spin Correlation Measurements and Bell’s Inequality 238 

3.11 Tensor Operators 246 

4 ■ Symmetry in Quantum Mechanics 

4. 1 Symmetries, Conservation Laws, and Degeneracies 262 

4.2 Discrete Symmetries, Parity, or Space Inversion 269 

4.3 Lattice Translation as a Discrete Symmetry 280 

4.4 The Time-Reversal Discrete Symmetry 284 

5 ■ Approximation Methods 

5.1 Time-Independent Perturbation Theory: Nondegenerate Case 303 

5 . 2 Time-Independent Perturb ation Theory : The Degenerate Case 316 

5.3 Hydrogen-Like Atoms: Fine Structure and the Zeeman Effect 321 

5.4 Variational Methods 332 

5.5 Time-Dependent Potentials: The Interaction Picture 336 

5.6 Hamiltonians with Extreme Time Dependence 345 

5.7 Time-Dependent Perturbation Theory 355 

5.8 Applications to Interactions with the Classical Radiation Field 365 

5.9 Energy Shift and Decay Width 371 

6 ■ Scattering Theory 

6.1 Scattering as a Time-Dependent Perturbation 386 

6.2 The Scattering Amplitude 391 

6.3 The Born Approximation 399 

6.4 Phase Shifts and Partial Waves 404 

6.5 Eikonal Approximation 417 

6.6 Low-Energy Scattering and Bound States 423 

6.7 Resonance Scattering 430 

6.8 Symmetry Considerations in Scattering 433 

6.9 Inelastic Electron-Atom Scattering 436 

7 ■ Identical Particles 

7.1 Permutation Symmetry 446 

7.2 Symmetrization Postulate 450 



262 



303 



386 



446 




Contents 



VII 



7.3 Two-Electron System 452 

7.4 The Helium Atom 455 

7.5 Multiparticle States 459 

7.6 Quantization of the Electromagnetic Field 472 

8 ■ Relativistic Quantum Mechanics 

8.1 Paths to Relativistic Quantum Mechanics 486 

8.2 The Dirac Equation 494 

8.3 Symmetries of the Dirac Equation 501 

8.4 Solving with a Central Potential 506 

8.5 Relativistic Quantum Field Theory 514 

A ■ Electromagnetic Units 

A.l Coulomb ’s Law, Charge, and Current 519 

A. 2 Con verting Between Systems 520 

B ■ Brief Summary of Elementary Solutions to Schrddinger's 
Wave Equation 

B. l Free Particles (V = 0) 523 

B .2 Piecewise Constant Potentials in One Dimension 524 

B.3 Transmission-Reflection Problems 525 

B.4 Simple Harmonic Oscillator 526 

B.5 The Central Force Problem [Spherically Symmetrical Potential 
V = V(r)\ 527 
B.6 Hydrogen Atom 531 

C ■ Proof of the Angular-Momentum Addition Rule Given by 
Equation (3.8.38) 

Bibliography 



486 



519 

523 



533 

535 



Index 



537 




Foreword to the First Edition 



J. J. Sakurai was always a very welcome guest here at CERN, for he was one of 
those rare theorists to whom the experimental facts are even more interesting than 
the theoretical game itself. Nevertheless, he delighted in theoretical physics and 
in its teaching, a subject on which he held strong opinions. He thought that much 
theoretical physics teaching was both too narrow and too remote from application: 

. ,we see a number of sophisticated, yet uneducated, theoreticians who are con- 
versant in the LSZ formalism of the Heisenberg held operators, but do not know 
why an excited atom radiates, or are ignorant of the quantum theoretic derivation 
of Rayleigh’s law that accounts for the blueness of the sky.” And he insisted that 
the student must be able to use what has been taught: “The reader who has read 
the book but cannot do the exercises has learned nothing.” 

He put these principles to work in his fine book Advanced Quantum Mechanics 
(1967) and in Invariance Principles and Element ary Particles (1964), both of 
which have been very much used in the CERN library. This new book, Modern 
Quantum Mechanics, should be used even more, by a larger and less specialized 
group. The book combines breadth of interest with a thorough practicality. Its 
readers will find here what they need to know, with a sustained and successful 
effort to make it intelligible. 

J. J. Sakurai’s sudden death on November 1, 1982 left this book unfinished. 
Reinhold Bertlmann and I helped Mrs. Sakurai sort out her husband’s papers at 
CERN. Among them we found a rough, handwritten version of most of the book 
and a large collection of exercises. Though only three chapters had been com- 
pletely finished, it was clear that the bulk of the Creative work had been done. It 
was also clear that much work remained to fill in gaps, polish the writing, and put 
the manuscript in order. 

That the book is now finished is due to the determination of Noriko Sakurai 
and the dedication of San Fu Tuan. Upon her husband’s death, Mrs. Sakurai re- 
solved immediately that his last effort should not go to waste. With great courage 
and dignity she became the driving force behind the project, overcoming all ob- 
stacles and setting the high standards to be maintained. San Fu Tuan willingly 
gave his time and energy to the editing and completion of Sakurai’s work. Per- 
haps only others close to the hectic field of high-energy theoretical physics can 
fully appreciate the sacrifice involved. 

For me personally, J. J. had long been far more than just a particularly dis- 
tinguished colleague. It saddens me that we will never again laugh together at 
physics and physicists and life in general, and that he will not see the success of 
his last work. But I am happy that it has been brought to fruition. 

John S. Bell 
CERN, Geneva 



IX 




Preface to the Revised Edition 



Since 1989 the editor has enthusiastically pursued a revised edition of Modem 
Quantum Mechanics by his late great fri end J. J. Sakurai, in order to extend this 
text’s usefulness into the twenty-first century. Much consultation took place with 
the panel of Sakurai friends who helped with the original edition, but in particular 
with Professor Yasuo Hara of Tsukuba University and Professor Akio Sakurai of 
Kyoto Sangyo University in Japan. 

This book is intendedfor the first-year graduate student who has studied quan- 
tum mechanics at the junior or senior level. It does not provide an introduction 
to quantum mechanics for the beginner. The reader should have had some expe- 
rience in solving time-dependent and time-independent wave equations. A famil- 
iarity with the time evolution of the Gaussian wave packet in a force-free region is 
assumed, as is the ability to solve one-dimensional transmission-reflection prob- 
lems. Some of the general properties of the energy eigenfunctions and the energy 
eigenvalues should also be known to the student who uses this text. 

The major motivation for this project is to revise the main text. There are three 
important additions and/or changes to the revised edition, which otherwise pre- 
serves the original version unchanged. These include a reworking of certain por- 
tions of Section 5.2 on time-independent perturbation theory for the degenerate 
case, by Professor Kenneth Johnson of M.I.T., taking into account a subtle point 
that has not been properly treated by a number of texts on quantum mechanics 
in this country. Professor Roger Newton of Indiana University contributed refine- 
ments on lifetime broadening in Stark effect and additional explanations of phase 
shifts at resonances, the optical theorem, and the non-normalizable State. These 
appear as “remarks by the editor” or “editor’s note” in the revised edition. Pro- 
fessor Thomas Fulton of the Johns Hopkins University reworked his Coulomb 
scattering contribution (Section 7.13); it now appears as a shorter text portion 
emphasizing the physics, with the mathematical details relegated to Appendix C. 

Though not a major part of the text, some additions were deemed necessary to 
take into account developments in quantum mechanics that have become promi- 
nent since November 1, 1982. To this end, two supplements are included at the 
end of the text. Supplement I is on adiabatic change and geometrical phase (pop- 
ularized by M. V. Berry since 1983) and is actually an English translation of the 
supplement on this subject written by Professor Akio Sakurai for the Japanese ver- 
sion of Modem Quantum Mechanics (copyright © Yoshioka-Shoten Publishing 
of Kyoto). Supplement II on nonexponential decays was written by my colleague 
here, Professor Xerxes Tata, and read over by Professor E. C. G. Sudarshan of 
the University of Texas at Austin. Although nonexponential decays have a long 



XI 




Preface to the Revised Edition 



history theoretically, experimental work on transition rates that tests such decays 
indirectly was done only in 1 990. Introduction of additional material is of course a 
subjective decision on the part of the editor; readers can judge its appropriateness 
for themselves. Thanks to Professor Akio Sakurai, the revised edition has been 
diligently searched to correct misprint errors of the first ten printings of the origi- 
nal edition. My colleague Professor Sandip Pakvasa provided me overall guidance 
and encouragement throughout this process of revision. 

In addition to the acknowledgments above, my former students Li Ping, Shi 
Xiaohong, and Yasunaga Suzuki provided the sounding board for ideas on the 
revised edition when taking may graduate quantum mechanics course at the Uni- 
versity of Hawaii during the spring of 1992. Suzuki provided the initial translation 
from Japanese of Supplement I as a course term paper. Dr. Andy Acker provided 
me with computer graphics assistance. The Department of Physics and Astron- 
omy, and particularly the High Energy Physics Group of the University of Hawaii 
at Manoa, again provided both the facilities and a conducive atmosphere for me to 
carry out my editorial task. Finally I wish to express my gratitude to physics (and 
sponsoring) senior editor Stuart Johnson and his editorial assistant Jennifer Dug- 
gan as well as senior production coordinator Amy Willcutt, of Addison-Wesley 
for their encouragement and optimism that the revised edition would indeed 
materialize. 

San Fu Tuan 
Honolulu, Hawaii 




Preface to the Second Edition 



Quantum mechanics fascinates me. It describes a wide variety of phenomena 
based on very few assumptions. It starts with a framework so unlike the differ- 
ential equations of classical physics, yet it contains classical physics within it. It 
provides quantitative predictions for many physical situations, and these predic- 
tions agree with experiments. In short, quantum mechanics is the ultimate basis, 
today, by which we understand the physical world. 

Thus, I was very pleased to be asked to write the nextre vised edition of Modem 
Quantum Mechanics, by J. J. Sakurai. I had taught this material out of this book 
for a few years and found myself very in tune with its presentation. Like many 
other instructors, however, I found some aspects of the book lacking and therefore 
introduced material from other books and from my own background and research. 
My hybrid class notes form the basis for the changes in this new edition. 

Of course, my original proposal was more ambitious than could be realized, 
and it still took much longer than I would have liked. So many excellent sugges- 
tions found their way to me through a number of reviewers, and I wish I had been 
able to incorporate all of them. I am pleased with the result, however, and I have 
tried hard to maintain the spirit of Sakurai’s original manuscript. 

Chapter 1 is essentially unchanged. Some of the figures were updated, and 
reference is made to Chapter 8, where the relativistic origin of the Dirac magnetic 
moment is laid out. 

Material was added to Chapter 2. This includes a new section on elementary 
solutions including the free particle in three dimensions; the simple harmonic 
oscillator in the Schrodinger equation using generating functions; and the linear 
potential as a way of introducing Airy functions. The linear potential solution is 
used to feed into the discussion of the WKB approximation, and the eigenvalues 
are compared to an experiment measuring “bouncing neutrons.” Also included 
is a brief discussion of neutrino oscillations as a demonstration of quantum- 
mechanical interference. 

Chapter 3 now includes solutions to Schrodinger’s equation for central poten- 
tials. The general radial equation is presented and is applied to the free particle 
in three dimensions with application to the infinite spherical well. We solve the 
isotropic harmonic oscillator and discuss its application to the “nuclear poten- 
tial well.” We also carry through the solution using the Coulomb potential with a 
discussion on degeneracy. Advanced mathematical techniques are emphasized. 

A subsection that has been added to Chapter 4 discusses the symmetry, known 
classically in terms of the Lenz vector, inherent in the Coulomb problem. This 




Preface to the Second Edition 



provides an introduction to SO(4) as an extension of an earlier discussion in Chap- 
ter 3 on continuous symmetries. 

There are two additions to Chapter 5. First, there is a new introduction to 
Section 5.3 that applies perturbation theory to the hydrogen atom in the context of 
relativistic corrections to the kinetic energy. This, along with some modifications 
to the material on spin-orbit interactions, is helpful for comparisons when the 
Dirac equation is applied to the hydrogen atom at the end of the book. 

Second, a new section on Hamiltonians with “extreme” time dependences has 
been added. This includes a brief discussion of the sudden approximation and a 
longer discussion of the adiabatic approximation. The adiabatic approximation is 
then developed into a discussion of Berry’s Phase, including a specific example 
(with experimental verification) in the spin j system. Some material from the first 
supplement for the previous addition has found its way into this section. 

The end of the book contains the most significant revisions, including reversed 
ordering of the chapters on Scattering and Identical Particles. This is partly be- 
cause of a strong feeling on my part (and on the part of several reviewers) that the 
material on scattering needed particular attention. Also, at the suggestion of re- 
viewers, the reader is brought closer to the subject of quantum held theory, both as 
an extension of the material on identical particles to include second quantization, 
and with a new chapter on relativistic quantum mechanics. 

Thus, Chapter 6, which now covers scattering in quantum mechanics, has a 
nearly completely rewritten introduction. A time-dependent treatment is used to 
develop the subject. Furthermore, the sections on the scattering amplitude and 
Bom approximation are rewritten to follow this new flow. This includes incor- 
porating what had been a short section on the optical theorem into the treatment 
of the scattering amplitude, before moving on to the Born approximation. The 
remaining sections have been edited, combined, and reworked, with some mate- 
rial removed, in an effort to keep what I, and the reviewers, felt were the most 
important pieces of physics from the last edition. 

Chapter 7 has two new sections that contain a significant expansion of the 
existing material on identical particles. (The section on Young tableaux has been 
removed.) Multiparticle States are developed using second quantization, and two 
applications are given in some detail. One is the problem of an electron gas in the 
presence of a positively charged uniform background. The other is the canonical 
quantization of the electromagnetic field. 

The treatment of multiparticle quantum States is just one path toward the de- 
velopment of quantum field theory. The other path involves incorporating special 
relativity into quantum mechanics, and this is the subject of Chapter 8. The sub- 
ject is introduced, and the Klein-Gordon equation is taken about as far as I believe 
is reasonable. The Dirac equation is treated in some detail, in more or less stan- 
dard fashion. Finally, the Coulomb problem is solved for the Dirac equation, and 
some comments are offered on the transition to a relativistic quantum field theory. 

The Appendices are reorganzied. A new appendix on electromagnetic units is 
aimed at the typical student who uses SI units as an undergraduate but is faced 
with Gaussian units in graduate school. 




Preface to the Second Edition 



xv 



I am an experimental physicist, and I try to incorporate relevant experimental 
results in my teaching. Some of these have found their way into this edition, most 
often in terms of figures taken mainly from modem publications. 

• Figure 1.6 demonstrates the use of a Stem-Gerlach apparatus to analyze the 
polarization States of a beam of cesium atoms. 

• Spin rotation in terms of the high-precision measurement of g — 2 for the 
muon is shown in Figure 2.1. 

• Neutrino oscillations as observed by the KamLAND collaboration are 
shown in Figure 2.2. 

• A lo vely experiment demonstrating the quantum energy le vels of “bounc- 
ing neutrons,” Figure 2.4, is included to emphasize agreement between the 
exact and WKB eigenvalues for the linear potential. 

• Figure 2.10 showing gravitational phase shift appeared in the previous edi- 
tion. 

• I included Figure 3.6, an old standard, to emphasize that the central- 
potential problems are very much applicable to the real world. 

• Although many measurements of parity violation have been carried out in 
the five decades since its discovery, Wu’s original measurement, Figure 4.6, 
remains one of the clearest demonstrations. 

• Berry’s Phase for spin \ measured with ultra-cold neutrons, is demonstrated 
in Figure 5.6. 

• Figure 6.6 is a clear example of how one uses scattering data to interpret 
properties of the target. 

• Sometimes, carefully executed experiments point to some problem in the 
predictions, and Figure 7.2 shows what happens when exchange symmetry 
is not included. 

• Quantization of the electromagnetic held is demonstrated by data on the 
Casimir effect (Figure 7.9) and in the observation of squeezed light (Fig- 
ure 7.10). 

• Finally, some classic demonstrations of the need for relativistic quantum 
mechanics are shown. Carl Anderson’s original discovery of the positron is 
shown in Figure 8.1. Modem information on details of the energy levels of 
the hydrogen atom is included in Figure 8.2. 

In addition, I have included a number of references to experimental work relevant 
to the discussion topic at hånd. 

My thanks go out to so many people who have helped me with this proj eet. Col- 
leagues in physics include John Cummings, Stuart Freedman, Joel Giedt, David 
Hertzog, Barry Holstein, Bob Jaffe, Joe Levinger, Alan Litke, Kam-Biu Luk, Bob 




Preface to the Second Edition 



McKeown, Harry Nelson, Joe Paki, Murray Peshkin, Olivier Pfister, Mike Snow, 
John Townsend, San Fu Tuan, David Van Baak, Dirk Walecka, Tony Zee, and also 
the reviewers who saw the various drafts of the manuscript. At Addison-Wesley, 
I have been guided through this process by Adam Black, Katie Conley, Ashley 
Eklund, Deb Greco, Dyan Menezes, and Jim Smith. I am also indebted to John 
Rogosich and Carol Sawyer from Techsetters, Inc., for their technical expertise 
and advice. My apologies to those whose names have slipped my mind as I write 
this acknowledgment. 

In the end, it is my sincere hope that this new edition is true to Sakurai’s 
original vision and has not been weakened significantly by my interloping. 

Jim Napolitano 
Troy, New York 




In Memoriam 



Jun John Sakurai was born in 1933 in Tokyo and came to the United States as 
a high school student in 1 949. He studied at Harvard and at Cornell, where he 
received his Ph.D. in 1958. He was then appointed assistant professor of physics 
at the University of Chicago and became a full professor in 1964. He stayed at 
Chicago until 1970 when he moved to the University of California at Los Ange- 
les, where he remained until his death. During his lifetime he wrote 119 articles 
on theoretical physics of elementary particles as well as several books and mono- 
graphs on both quantum and particle theory. 

The discipline of theoretical physics has as its principal aim the formulation of 
theoretical descriptions of the physical world that are at once concise and compre- 
hensive. Because nature is subtle and complex, the pursuit of theoretical physics 
requires bold and enthusiastic ventures to the frontiers of newly discovered phe- 
nomena. This is an area in which Sakurai reigned supreme, with his uncanny 
physical insight and intuition and also his ability to explain these phenomena to 
the unsophisticated in illuminating physical terms. One has but to read his very 
lucid textbooks on Invariance Principles and Elementary Particles and Advanced 
Quantum Mechanics, or his reviews and summer school leetures, to appreciate 
this. Without exaggeration I could say that mueh of what I did understand in par- 
ticle physics came from these and from his articles and private tutoring. 

When Sakurai was still a graduate student, he proposed what is now known as 
the V-A theory of weak interactions, independently of (and simultaneously with) 
Richard Feynman, Murray Gell-Mann, Robert Marshak, and George Sudarshan. 
In 1 960 he published in Annals of Physics a prophetic paper, probably his single 
most important one. It was concerned with the first serious attempt to construct 
a theory of strong interactions based on Abelian and non- Abelian (Yang-Mills) 
gauge invariance. This seminal work induced theorists to attempt an understand- 
ing of the mechanisms of mass generation for gauge (vector) helds, now recog- 
nized as the Higgs mechanism. Above all it stimulated the search for a realistic 
uniheation of forces under the gauge principle, since crowned with success in 
the celebrated Glashow-Weinberg-Salam uniheation of weak and electromagnetic 
forces. On the phenomenological side, Sakurai pursued and vigorously advocated 
the vector mesons dominance model of hadron dynamics. He was the hrst to dis- 
cuss the mixing of a> and <p meson States. Indeed, he made numerous important 
contributions to particle physics phenomenology in a mueh more general sense, 
as his heart was always close to experimental activities. 

I knew Jun John for more than 25 years, and I had the greatest admiration not 
only for his immense powers as a theoretical physicist but also for the warmth 




In Memoriam 



and generosity of his spirit. Though a graduate student himself at Cornell during 
1957-1958, he took time from his own pioneering research in K-nucleon disper- 
sion relations to help me (via extensive correspondence) with my Ph.D. thesis on 
the same subject at Berkeley. Both Sandip Pakvasa and I were privileged to be 
associated with one of his last papers on weak couplings of heavy quarks, which 
displayed once more his infectious and intuitive style of doing physics. It is of 
course gratifying to us in retrospect that Jun John counted this paper among the 
score of his published works that he particularly enjoyed. 

The physics community suffered a great loss at Jun John Sakurai’s death. The 
personal sense of loss is a severe one for me. Hence I am profoundly thankful 
for the opportunity to edit and complete his manuscript on Modern Quantum 
Mechanics for publication. In my faith no greater gift can be given me than an 
opportunity to show my respect and love for Jun John through meaningful service. 




CHAPTER 




Fundamental Concepts 



The revolutionary change in our understanding of microscopic phenomena that 
took place during the first 27 years of the twentieth century is unprecedented in 
the history of natural Sciences. Not only did we witness severe limitations in the 
validity of classical physics, but we found the alternative theory that replaced the 
classical physical theories to be far broader in scope andfarricher in its range of 
applicability. 

The most traditional way to begin a study of quantum mechanics is to follow 
the historical developments — Planck’s radiation law, the Einstein-Debye theory of 
specific heats, the Bohr atom, de Broglie’s matter waves, and so forth — together 
with careful analyses of some key experiments such as the Compton effect, the 
Franck-Hertz experiment, and the Davisson-Germer-Thompson experiment. In 
that way we may come to appreciate how the physicists in the first quarter of the 
twentieth century were forced to abandon, little by little, the cherished concepts 
of classical physics and how, despite earlier false starts and wrong tums, the great 
masters — Heisenberg, Schrodinger, and Dirac, among others — finally succeeded 
in formulating quantum mechanics as we know it today. 

However, we do not follow the historical approach in this book. Instead, we 
start with an example that illustrates, perhaps more than any other example, the 
inadequacy of classical concepts in a fundamental way. We hope that, exposing 
readers to a “shock treatment” at the onset will result in their becoming attuned 
to what we might call the “quantum-mechanical way of thinking” at a very early 
stage. 

This different approach is not merely an academic exercise. Our knowledge 
of the physical world comes from making assumptions about nature, formulating 
these assumptions into postulates, deriving predictions from those postulates, and 
testing such predictions against experiment. If experiment does not agree with 
the prediction, then, presumably, the original assumptions were incorrect. Our 
approach emphasizes the fundamental assumptions we make about nature, upon 
which we have come to base all of our physical laws, and which aim to accom- 
modate profoundly quantum-mechanical observations at the outset. 



1 .1 ■ THE STERN-GERLACH EXPERIMENT 

The example we concentrate on in this section is the Stem-Gerlach experiment, 
originally conceived by O. Stem in 1921 and carried out in Frankfurt by him in 



1 




Chapter 1 Fundamental Concepts 



What was 




magnetic field 

FIGURE 1.1 The Stem-Gerlach experiment. 

collaboration with W. Gerlach in 1922.* This experiment illustrates in a dramatic 
manner the necessity for a radical departure from the concepts of classical me- 
chanics. In the subsequent sections the basic formalism of quantum mechanics is 
presented in a somewhat axiomatic manner but always with the example of the 
Stern-Gerlach experiment in the back of our minds. In a certain sense, a two-state 
system of the Stern-Gerlach type is the least classical, most quantum-mechanical 
system. A solid understanding of problems involving two-state systems will turn 
out to be rewarding to any serious student of quantum mechanics. It is for this 
reason that we refer repeatedly to two-state problems throughout this book. 

Description of the Experiment 

We now present a brief discussion of the Stern-Gerlach experiment, which is dis- 
cussed in almost every book on modern physics. ^ First, silver (Ag) atoms are 
heated in an oven. The oven has a small hole through which some of the silver 
atoms escape. As shown in Figure 1.1, the beam goes through a collimator and 
is then subjected to an inhomogeneous magnetic field produced by a pair of pole 
pieces, one of which has a very sharp edge. 

We must now work out the effect of the magnetic fi eld on the silver atoms. 
For our purpose the following oversimplified model of the silver atom suffices. 
The silver atom is made up of a nucleus and 47 electrons, where 46 out of the 47 
electrons can be visualized as forming a spherically symmetrical electron cloud 
with no net angular momentum. If we ignore the nuclear spin, which is irrelevant 
to our discussion, we see that the atom as a whole does have an angular momen- 
tum, which is due solely to the spin — intrinsic as opposed to orbital — angular 

*For an excellent historical discussion of the Stem-Gerlach experiment, see “Stem and Gerlach: 
How a Bad Cigar Helped Reorient Atomic Physics,” by Bretislav Friedrich and Dudley Her- 
schbach, Physics Today, December (2003) 53. 

^For an elementary but enlightening discussion of the Stem-Gerlach experiment, see French and 
Taylor (1978), pp. 432-38. 




1.1 The Stern-Gerlach Experi ment 



3 



momentum of the single 47th (5 s) electron. The 47 electrons are attached to the 
nucleus, which is ~2 x 10 5 times heavier than the electron; as a result, the heavy 
atom as a whole possesses a magnetic moment equal to the spin magnetic mo- 
ment of the47th electron. In other words, the magnetic moment (i of the atom is 
proportional to the electron spin S, 

/i ocS, (1.1.1) 



where the precise proportionality factor turns out to be e/ m e c (e < 0 in this book) 
to an accuracy of about 0.2%. 

Because the interaction energy of the magnetic moment with the magnetic held 
is just — ll'B, the z-component of the force experienced by the atom is given by 



F z = y<Jl- 

oz oz 



( 1 . 1 . 2 ) 



where we have ignored the components of B in directions other than the z- 
direction. Because the atom as a whole is very heavy, we expect that the classical 
concept of trajectory can be legitimately applied, a point that can be justified us- 
ing the Heisenberg uncertainty principle to be derived later. With the arrangement 
of Figure 1.1, the \i z > 0 (S z < 0) atom experiences a downward force, while the 
/i z < 0 (S z > 0) atom experiences an upward force. The beam is then expected 
to get split according to the values of /i. z . In other words, the SG (Stern-Gerlach) 
apparatus “measures” the z-component of fi or, equivalently, the z-component of 
S up to a proportionality factor. 

The atoms in the oven are randomly oriented; there is no preferred direction 
for the orienta tion of fi. If the electron were like a classical spinning object, we 
would expect all values of /x z to be realized between \fi\ and — |/t|. This would 
lead us to expect a continuous bundle of beams coming out of the SG apparatus, 
as indicated in Figure 1.1, spread more or less evenly over the expected range. 
Instead, what we experimentally observe is more like the situation also shown 
in Figure 1.1, where two “spots” are observed, corresponding to one “up” and 
one “down” orientation. In other words, the SG apparatus splits the original silver 
beam from the oven into two distinet components, a phenomenon referred to in 
the early days of quantum theory as “space quantization.” To the extent that /i 
can be identified within a proportionality factor with the electron spin S, only two 
possible values of the z-component of S are observed to be possible: S z up and S z 
down, which we call S z + and S z —, The two possible values of S z are multiples 
of some fundamental unit of angular momentum; numerically it turns out that 
S z = h/2 and —h/2, where 



h = 1.0546 x 10 27 erg-s 
= 6.5822 x 10 -16 eV-s. 



This “quantization” of the electron spin angular momentum* is the first important 
feature we deduce from the Stern-Gerlach experiment. 

*An understanding of the roots of this quantization lies in the application of relativity to quantum 
mechanics. See Section 8.2 of this book for a discussion. 




4 



Chapter 1 Fundamental Concepts 





(a) (b) 

FIGURE 1.2 (a) Classical physics prediction for results from the Stem-Gerlach exper- 

iment. The beam should have been spread out vertically, over a distance corresponding 
to the range of values of the magnetic moment times the cosine of the orientation angle. 
Stem and Gerlach, however, observed the result in (b), namely that only two orientations 
of the magnetic moment manifested themselves. These two orientations did not spån the 
entire expected range. 



Figure 1.2a shows the result one would have expected from the experiment. 
According to classical physics, the beam should have spread itself over a vertical 
distance corresponding to the (continuous) range of orientation of the magnetic 
moment. Instead, one observes Figure lb, which is completely at odds with classi- 
cal physics. The beam mysteriously splits itself into two parts, one corresponding 
to spin “up” and the other to spin “down.” 

Of course, there is nothing sacred about the up-down direction or the z-axis. We 
could just as well have applied an inhomogeneous held in a horizontal direction, 
say in the x-direction, with the beam proceeding in the y-direction. In this manner 
we could have separated the beam from the oven into an S x + component and an 
S x — component. 

Sequential Stern-Gerlach Experiments 

Let us now consider a sequential Stern-Gerlach experiment. By this we mean 
that the atomic beam goes through two or more SG apparatuses in sequence. The 
first arrangement we consider is relatively straightforward. We subject the beam 
coming out of the oven to the arrangement shown in Figure 1.3a, where SGz 
stands for an apparatus with the inhomogeneous magnetic held in the z-direction, 
as usual. We thenblock the S z — component coming out of the first SGz apparatus 
and let the remaining S z + component be subjected to another SGz apparatus. This 
time there is only one beam component coming out of the second apparatus — just 
the S z + component. This is perhaps not so surprising; after all, if the atom spins 
are up, they are expected to remain so, short of any external field that rotates the 
spins between the first and the second SGz apparatuses. 

A little more interesting is the arrangement shown in Figure 1.3b. Here the 
first SG apparatus is the same as before, but the second one (SGx) has an inhomo- 
geneous magnetic field in the x-direction. The S z + beam that enters the second 
apparatus (SGx) is now split into two components, an S x + component and an 



1.1 The Stern-Gerlach Experiment 



5 




(a) 




(b) 




S - beam S x - beam 

(c) 

FIGURE 1.3 Sequential Stern-Gerlach experiments. 



S x — component, with equal intensities. How can we explain this? Does it mean 
that 50% of the atoms in the S z + beam coming out of the first apparatus (SGz) 
are made up of atoms characterized by both S z + and S x -\-, while the remaining 
50% have both and S x — ? It turns out that such a picture runs into difficulty, 
as we will see below. 

We now consider a third step, the arrangement shown in Figure 1.3c, which 
most dramatically illustrates the peculiarities of quantum-mechanical systems. 
This time we add to the arrangement of Figure 1.3b yet a third apparatus, of 
the SGz type. It is observed experimentally that two components emerge from the 
third apparatus, not one; the emerging beams are seen to have both an S z + compo- 
nent and an S z — component. This is a complete surprise because after the atoms 
emerged from the first apparatus, we made sure that the S z — component was com- 
pletely blocked. How is it possible that the S z — component, which we thought, 
we eliminated earlier, reappears? The model in which the atoms entering the third 
apparatus are visualized to have both S z + and S x + is clearly unsatisfactory. 

This example is often used to illustrate that in quantum mechanics we cannot 
determine both S z and S x simultaneously. More precisely, we can say that the 
selection of the 5*+ beam by the second apparatus (SGx) completely destroys 
any previous information about S z . 

It is amusing to compare this situation with that of a spinning top in classical 
mechanics, where the angular momentum 

L = Ico (1.1.4) 

can be measured by determining the components of the angular- velocity vector 
co. By observing how fast the object is spinning in which direction, we can deter- 
mine co x , co y , and æ z simultaneously. The moment of inertia I is computable if we 













Chapter 1 Fundamental Concepts 



know the mass density and the geometric shape of the spinning top, so there is no 
difficulty in specifying both L z and L x in this classical situation. 

It is to be clearly understood that the limitation we have encountered in deter- 
mining S z and S x is not due to the incompetence of the experimentalist. We cannot 
make the S z — component out of the third apparatus in Figure 1.3c disappear by 
improving the experimental techniques. The peculiarities of quantum mechanics 
are imposed upon us by the experiment itself. The limitation is, in faet, inherent 
in microscopic phenomena. 

Analogy with Polarization of Light 

Because this situation looks so novel, some analogy with a familiar classical situ- 
ation may be helpful here. To this end we now digress to consider the polarization 
of light waves. This analogy will help us develop a mathematical framework for 
formulating the postulates of quantum mechanics. 

Consider a monochromatic light wave propagating in the z-direction. A 
linearly polarized (or plane polarized) light with a polarization vector in the 
x-direction, which we call for short an x-polarized light, has a space-time- 
dependent electric held oscillating in the x-direction 

E = Eqx cos(kz — cot). (1.1.5) 

Likewise, we may consider a y-polarized light, also propagating in the z-direction, 

E = Eoy cos(kz — cot). (1.1.6) 

Polarized light beams of type (1.1.5) or (1.1.6) can be obtained by letting an un- 
polarized light beam go through a Polaroid filter. We call a filter that selects only 
beams polarized in the x-direction an x-filter. An x-filter, of course, becomes a y- 
filter when rotated by 90° aboutthe propagation (z) direction. It is well known that 
when we let a light beam go through an x-filter and subsequently let it impinge on 
a y- filter, no light beam comes out (provided, of course, that we are dealing with 
100% efficient Polaroids); see Figure 1.4a. 

The situation is even more interesting if we insert between the x-filter and the 
y-filter yet another Polaroid that selects only a beam polarized in the direction — 
which we call the x'-direction — that makes an angle of 45° with the x-direction 
in the xy-plane; see Figure 1.4b. This time, there is a light beam coming out of 
the y-filter despite the faet that right af ter the beam went through the x-filter it did 
not have any polarization component in the y-direction. In other words, once the 
x ' -filter intervenes and selects the x' -polarized beam, it is immaterial whether the 
beam was previously x-polarized. The selection of the x' -polarized beam by the 
second Polaroid destroys any previous information on light polarization. Notice 
that this situation is quite analogous to the situation that we encountered earlier 
with the SG arrangement of Figure 1.3b, provided that the following correspon- 
dence is made: 



S z ± atoms «e> x-, y-polarized light 
S x ± atoms <e> x' y '-polarized light, 
where the x'- and y'-axes are defined as in Figure 1.5. 



(1.1.7) 




1 .1 The Stern-Gerlach Experiment 



7 




x- filter 



y-filter 





(a) 




x-filter x'-filter _y-filter 

(45° diagonal) 



(b) 

FIGURE 1.4 Light beams subjected to Polaroid filters. 




FIGURE 1.5 Orientations of the x'- and y'-axes. 



Let us examine how we can quantitatively describe the behavior of 45°- 
polarized beams ( x and y'-polarized beams) within the framework of classical 
electrodynamics. Using Figure 1 .5 we obtain 



Eqx' cos (kz — cut) = Eq 
E oy'cos(kz — cut) = E$ 



— zxcos {kz — cot) H — — y cos (kz — cot) 
,V2 V2 

— xcos (kz — cot) H — py cos (kz — cot) 

. V2 V2 



( 1 . 1 . 8 ) 










8 



Chapter 1 Fundamental Concepts 



In the triple-filter arrangement of Figure 1.4b, the beam coming out of the first 
Polaroid is an x-polarized beam, which can be regarded as a linear combination 
of an x' -polarized beam and a y' -polarized beam. The second Polaroid selects 
the x' -polarized beam, which can in turn be regarded as a linear combination of 
an x-polarized and a y-polarized beam. And finally, the third Polaroid selects the 
y-polarized component. 

Applying correspondence (1.1.7) from the sequential Stern-Gerlach experi- 
ment of Figure 1.3c to the triple-filter experiment of Figure 1.4b suggests that we 
might be able to represent the spin State of a silver atom by some kind of vector 
in a new kind of two-dimensional vector space, an abstract vector space not to be 
confused with the usual two-dimensional (xy) space. Just as x and y in (1.1.8) are 
the base vectors used to decompose the polarization vector x' of the x'-polarized 
light, it is reasonable to represent the S* + State by a vector, which we call a ket in 
the Dirac notation to be developed fully in the next section. We denote this vector 
by 15^;+) and write it as a linear combination of two base vectors, |S Z ;+) and 
| S z ; — ), which correspond to the 5 Z + and the S z — States, respectively. So we may 
conjecture 



1 



1 



!&;+) = -~=|S Z ;+) + — |S Z ;— ) 
V2 v2 

I $x '■> — ) = — ’ _ 



(1.1.9a) 



(1.1.9b) 



in analogy with (1.1.8). Later we will show how to obtain these expressions using 
the general formalism of quantum mechanics. 

Thus the unblocked component coming out of the second (SGx) apparatus of 
Figure 1.3c is to be regarded as a superposition of S z + and S z — in the sense of 
(1.1 ,9a). It is for this reason that two components emerge from the third (SGz) 
apparatus. 

The next question of immediate concern is, How are we going to represent 
the S y ± States? Symmetry arguments suggest that if we observe an S z ± beam 
going in the x-direction and subject it to an SGy apparatus, the resulting situation 
will be very similar to the case where an S z ± beam going in the y-direction is 
subjected to an SGx apparatus. The kets for S y ± should then be regarded as a 
linear combination of |S Z ;±), but it appears from (1.1.9) that we have already 
used up the available possibilities in writing How can our vector space 

formalism distinguish S y ± States from S x ± States? 

An analogy with polarized light again rescues us here. This time we co nsider 
a circularly polarized beam of light, which can be obtained by letting a linearly 
polarized light pass through a quarter-wave plate. When we pass such a circu- 
larly polarized light through an x-filter or a y-filter, we again obtain either an 
x-polarized beam or a y-polarized beam of equal intensity. Yet everybody knows 
that the circularly polarized light is totally different from the 45°-linearly polar- 
ized (x'-polarized or / -polarized) light. 

Mathematically, how do we represent a circularly polarized light? A right cir- 
cularly polarized light is nothing more than a linear combination of an x-polarized 




1 .1 The Stern-Gerlach Experiment 



9 



light and a y-polarized light, where the oscillation of the electric field for the y- 
polarized component is 90° out of phase with that of the x-polarized component:* 



E = £ 0 



1 1 ^ / n\ 

— zxcos (kz — dot) 4 — — ycos \kz — cot + — 1 
Jl J2 V 2 / 



( 1 . 1 . 10 ) 



.72 v 7 72 

It is more elegant to use complex notation by introducing e as follows: 

Re(e) = E/£ 0 - (1.1.11) 



For a right circularly polarized light, we can then write 



( - 



_L-£ i i_*Akz-æt) 

.72 V2 K 



( 1 . 1 . 12 ) 



where we have used i = e !jr / 2 . 

We can make the following analogy with the spin States of silver atoms: 



S y + atom -o- right circularly polarized beam, 
S y — atom -o- left circularly polarized beam. 



(1.1.13) 



Applying this analogy to (1.1.12), we see that if we are allowed to make the 
coefficients preceding base kets complex, there is no difficulty in accommodating 
the S y ± atoms in our vector space formalism: 



\S y -,±)=^-\S z ;+)±^-\S z -,-) 



72 



72 ' 



(1.1.14) 



which are obviously different from (1.1.9). We thus see that the two-dimensional 
vector space needed to describe the spin States of silver atoms must be a complex 
vector space; an arbitrary vector in the vector space is written as a linear combi- 
nation of the base vectors | S z ; ±) with, in general, complex coefficients. The faet 
that the necessity of complex numbers is already apparent in such an elementary 
example is rather remarkable. 

The reader must have noted by this time that we have deliberately avoided 
talking about photons. In other words, we have completely ignored the quantum 
aspect of light; nowhere did we mention the polarization States of individual pho- 
tons. The analogy we worked out is between kets in an abstract vector space that 
describes the spin States of individual atoms with the polarization vectors of the 
classical electromagnetic field. Actually, we could have made the analogy even 
more vivid by introducing the photon concept and talking about the probability 
of finding a circularly polarized photon in a linearly polarized State, and so forth; 
however, that is not needed here. Without doing so, we have already accomplished 
the main goal of this section: to introduce the idea that quantum-mechanical States 
are to be represented by vectors in an abstract complex vector space. ^ 

*Unfortunately, there is no unanimity in the definition of right versus left circularly polarized 
light in the literature. 

^The reader who is interested in grasping the basic concepts of quantum mechanics through a 
careful study of photon polarization may find Chapter 1 of Baym (1969) extremely illuminating. 




60 cm 



10 



Chapter 1 Fundamental Concepts 




CCD 

camera 

image 



Detection 

laser 



Permanent 

magnet 

(movable) 





FIGURE 1.6 A modem Stem-Gerlach apparatus, used to separate spin States of atomic 
cesium, taken from F. Lison et al., Phys. Rev. A 61 (1999) 013405. The apparatus is 
shown on the left, while the data show the nine different projections for the spin-four 
atom, (a) before and (b) after Optical pumping is used to populate only extreme spin pro- 
jections. The spin quantum number F = 4 is a coupling between the outermost electron 
in the atom and the nuclear spin / =7 /2. 



Finally, before outlining the mathematical formalism of quantum mechanics, 
we remark that the physics of a Stern-Gerlach apparatus is of far more than simply 
academic interest. The ability to separate spin States of atoms has tremendous 
practical interest as well. Figure 1 .6 shows the use of the Stern-Gerlach technique 
to analyze the result of spin manipulation in an atomic beam of cesium atoms. 
The only stable isotope, l33 Cs, of this alkali atom has a nuclear spin 1 = 7/2, 
and the experiment sorts out the F = 4 hyperfine magnetic substate, giving nine 
spin orientations. This is only one of many examples where this once mysterious 
effect is used for practical devices. Of course, all of these uses only go to firmly 
establish this effect, as well as the quantum-mechanical principles that we will 
now present and further develop. 



1.2 ■ KETS, BRAS, AND OPERATORS 

In the preceding section we showed how analyses of the Stern-Gerlach experi- 
ment lead us to consider a complex vector space. In this and the following section 
we formulate the basic mathematics of vector spaces as used in quantum mechan- 
ics. Our notation throughout this book is the bra and ket notation developed by 
P. A. M. Dirac. The theory of linear vector spaces had, of course, been known to 
mathematicians prior to the birth of quantum mechanics, but Dirac’s way of intro- 






1 .2 Kets, Bras, and Operators 



11 



ducing vector spaces has many advantages, especially from the physicist’s point 
of view. 

Ket Space 

We co nsider a complex vector space whose dimensionality is specified according 
to the nature of a physical system under consideration. In Stern-Gerlach-type 
experiments where the only quantum-mechanical degree of freedom is the spin 
of an atom, the dimensionality is determined by the number of alternative paths 
the atoms can follow when subjected to a SG apparatus; in the case of the silver 
atoms of the previous section, the dimensionality is just two, corresponding to the 
two possible values S z can assume.* Later, in Section 1.6, we consider the case 
of continuous spectra — for example, the position (coordinate) or momentum of a 
particle — where the number of alternatives is nondenumerably infinite, in which 
case the vector space in question is known as a Hilbert space af ter D. Hilbert, 
who studied vector spaces in infinite dimensions. 

In quantum mechanics a physical State — for example, a silver atom with a 
definite spin orientation — is represented by a State vector in a complex vector 
space. Following Dirac, we call such a vector a ket and denote it by |a). This 
State ket is postulated to contain complete information about the physical State; 
everything we are allowed to ask about the State is contained in the ket. Two kets 
can be added: 



\a) + \P) = \Y). (1.2.1) 

The sum | y) is just another ket. If we mul tiply |a) by a complex number c , the 
resulting product c\a ) is another ket. The number c can stand on the left or on the 
right of a ket; it makes no difference: 

c\a ) = | a)c. (1.2.2) 

In the particular case where c is zero, the resulting ket is said to be a nuli ket. 

One of the physics postulates is that |a) and eja), with c ^ 0, represent the 
same physical State. In other words, only the “direction” in vector space is of 
significance. Mathematicians may prefer to say that we are here dealing with rays 
rather than vectors. 

An observable, such as momentum and spin components, can be represented 
by an operator, such as A, in the vector space in question. Quite generally, an 
operator acts on a ket from the left, 

A-(\a)) = A\a), (1.2.3) 

which is yet another ket. There will be more on multiplication operations later. 



*For many physical systems the dimension of the State space is denumerably infinite. Although 
we will usually indicate a finite number of dimensions, N, of the ket space, the results also hold 
for denumerably infinite dimensions. 




12 



Chapter 1 Fundamental Concepts 



In general, A\a) is not a co nstant times |a). However, there are particular kets 
of importance, known as eigenkets of operator A, denoted by 






(1-2.4) 



with the property 



A\a') =a'\a'),A\a") =a"\a"),... (1.2.5) 

where a',a " , ... are just numbers. Notice that applying A to an eigenket just re- 
produces the same ket apart from a multiplicative number. The set of numbers 
{a',a",a '", ...}, more compactly denoted by {a\, is called the set of eigenval- 
ues of operator A. When it becomes necessary to order eigenvalues in a specific 
manner, {a^\a^ 2 \a ^ 3 \ . . .} may be used inplace of {a' ,a" ,a "', .. .}. 

The physical State corresponding to an eigenket is called an eigenstate. In 
the simplest case of spin j systems, the eigenvalue-eigenket relation (1.2.5) is 
expressed as 



S Z \S Z ;+) = ||5 z ;+>, S z |S z ;-> = -||S z ;->, (1.2.6) 

where | S z ; ±) are eigenkets of operator S z with eigenvalues ±h /2. Here we could 
have used just \h/2) for | S z ; +) in conformity with the notation | a'), where an 
eigenket is labeled by its eigenvalue, but the notation |S Z ;±), already used in the 
previous section, is more convenient here because we also consider eigenkets of 

S*: 



S x \S x -,±)=±^\S x -,±). (1.2.7) 

We remarked earlier that the dimensionality of the vector space is determined 
by the number of alternatives in Stern-Gerlach-type experiments. More formally, 
we are concerned with an A'-dimensional vector space spanned by the N eigenkets 
of observable A. Any arbitrary ket I«) can be written as 



I = (1-2.8) 

a' 

with a' ,a " , ... up to a <N> , where c a > is a complex coefficient. The question of the 
uniqueness of such an expansion will be postponed until we prove the orthogo- 
nality of eigenkets. 

Bra Space and I nner Products 

The vector space we have been dealing with is a ket space. We now introduce the 
notion of a bra space, a vector space “dual to” the ket space. We postulate that 
corresponding to every ket |a) there exists a bra, denoted by (al, in this dual, or 
bra, space. The bra space is spanned by eige nbras {(«'!}, which correspondto the 




1 .2 Kets, Bras, and Operators 



13 



eigenkets {|a')}- There is a one-to-one correspondence between a ket space and a 
bra space: 



|a)*»(a| 

\a'),\a"),..™(a'\,(a"\,... 0-2.9) 

DC 

|a) + IÆ)4>(a| + (id|, 

where DC stands for dual correspondence. Roughly speaking, we can regard the 
bra space as some kind of mirror image of the ket space. 

The bra dual to c\a ) is postulated to be c* (al, not c{a\, which is a very impor- 
tant point. More generally, we have 



c a \a) + cp\P)4*cZ(a\+c* p {p\. (1.2.10) 

We now define the inner product of a bra and a ket.* The product is written 
as a bra standing on the left and a ket standing on the right; for example, 

(yS|«> = (<yS|)-(|«». (1.2.11) 

bra(c)ket 

This product is, in general, a complex number. Notice that in forming an inner 
product, we always take one vector from the bra space and one vector from the 
ket space. 

We postulate two fundamental properties of inner products. First, 

<yd|a) = (a|yd)*. (1.2.12) 

In other words, (yd| a) and (a| yd) are complex conjugates of each other. Notice 
that even though the inner product is, in some sense, analogous to the familiar 
scalar product a • b, (yd |a) must be clearly distinguished from (a|y 6) ; the analogous 
distinction is not needed in real vector space because a • b is equal to b • a. Using 
(1 .2.12) we can immediately deduce that (a|a) must be a real number. To prove 
this just let (yd | — > (a|. 

The second postulate on inner products is 

(ala) > 0, (1.2.13) 

where the equality sign holds only if |a) is a nuli ket. This is sometimes known 
as the postulate of positive definite metric. From a physicist’s point of view, this 
postulate is essential for the probabilistic interpretation of quantum mechanics, as 
will become apparent laterJ 

*In the literature an inner product is often ref erred to as a scalar product because it is analogous to 
a • b in Euclidean space; in this book, however, we reserve the term scalar for a quantity invariant 
under rotations in the usual three-dimensional space. 

1 Attempts to abandon this postulate led to physical theories with “indefinite metric.” We shall not 
be concemed with such theories in this book. 




14 



Chapter 1 Fundamental Concepts 



Two kets la) and \( J >) are said to be orthogonal if 

<a|/3)=0, (1.2.14) 

even though in the definition of the inner product, the bra (a| appears. The or- 

thogonality relation (1.2.14) also implies, via (1.2.12), 

</8|a)=0. (1.2.15) 

Given a ket that is not a nuli ket, we can form a normalized ket |a), where 

|a) = (— =L=) I«). (1.2.16) 

with the property 

(a|a) = 1. (1.2.17) 

Quite generally, ^/(a|a) is known as the norm of |a), analogous to the magnitude 
of vector V a • a = |a| in Euclidean vector space. Because |a) and c\a) represent 
the same physical State, we might as well require that the kets we use for physical 
States be normalized in the sense of (1.2. 17).* 



Operators 

As we noted earlier, observables such as momentum and spin components are to 
be represented by operators that can act on kets. We can co nsider a more general 
class of operators that act on kets; they will be denoted by X, Y, and so forth, while 
A, B, and so on will be used for a restrictive class of operators that correspond to 
observables. 

An operator acts on a ket from the left side, 

X • (|a)) = X\a), (1.2.18) 

and the resulting product is another ket. Operators X and Y are said to be equal, 

X = Y, (1.2.19) 

if 

X\a) = Y\a) (1.2.20) 

for an arbitrary ket in the ket space in question. Operator X is said to be the nuli 
operator if, for any arbitrary ket |a), we have 

X\a) — 0. (1.2.21) 

*For eigenkets o f observables with continuous spectra, different normalization conventions will 
be used; see Section 1.6. 




1 .2 Kets, Bras, and Operators 



15 



Operators can be added; addition operations are commutative and associative: 

X + Y = Y + X, (1.2.21a) 

X + (Y + Z) = (X + Y) + Z. (1.2.21b) 

With the single exception of the time-reversal operator to be considered in Chapter 
4, the operators that appear in this book are all linear; that is, 

X(c a \ a )+cf } m = c a X\u)+Cf } X\P). (1.2.22) 

An operator X always acts on a bra from the right side 

«a|)-X = (a|X, (1.2.23) 

and the resulting product is another bra. The ket X\ct) and the bra (a \X are, in 
general, not dual to each other. We define the symbol X" as 

Z|Q!)^(Q!|Z t . (1.2.24) 

The operator X" is called the Hermitian ad joint, or simply the adjoint, of X. An 
operator X is said to be Hermitian if 

Z = Z t . (1.2.25) 



Multiplication 

Operators X and Y can be multiplied. Multiplication operations are, in general, 
noncommutative ; that is, 

XY^YX. (1.2.26) 

Multiplication operations are, however, associative: 

X(YZ) = (XY)Z = XYZ. (1.2.27) 

We also have 

X(7|a)) = (X7)|o:)=X7|a), ((/3\X)Y = (J3\(XY) = (J3\XY. (1.2.28) 

Notice that 

(XY) f = Y f X f (1.2.29) 

because 

XY\a) = Z(7|a))^((a|7 t )X t = (a|7 t Z t . (1.2.30) 

So far, we have considered the following products: ,X\a.) , (a|X, and XY. 

Are there other products we are allowed to form? Let us multiply |/1) and (al, in 
that order. The resulting product 



m-((a\)=\p)(a\ 



(1.2.31) 




16 



Chapter 1 Fundamental Concepts 



is known as the outer product of |/3) and (a\. We will emphasize in a moment 
that |/3) (a | is to be regarded as an operator; hence it is fundamentally different 
from the inner product (/3\ a), which is just a number. 

There are also “illegal products.” We have already mentioned that an operator 
must stand on the left of a ket or on the right of a bra. In other words, |a)A 
and X (a | are examples of illegal products. They are neither kets, nor bras, nor 
operators; they are simply nonsensical. Products such as \a)\fi) and (a | (fi | are 
also illegal when | a) and \fi) ((a| and (fi\) are ket (bra) vectors belonging to the 
same ket (bra) space.* 

The Associative Axiom 

As is clear from (1.2.27), multiplication operations among operators are associa- 
tive. Actually the associative property is postulated to hold quite generally as long 
as we are dealing with “legal” multiplications among kets, bras, and operators. 
Dirac calls this important postulate the associative axiom of multiplication. 

To illustrate the power of this axiom, let us first consider an outer product 
acting on a ket: 



(lØXalHy). (1-2-32) 

Because of the associative axiom, we can regard this equally well as 

l/S)-((a|K», (1.2.33) 



where (o:| y) isjusta number. Thus the outer product acting on a ket i s just another 
ket; in other words, |/3) (cz| can be regarded as an operator. Because (1.2.32) and 
(1.2.33) are equal, we may as well omit the dots and let \/3){a\y) stand for the 
operator \fi){a\ acting on \y) or, equivalently, the number (a| y) multiplying \[i). 
(On the other hånd, if (1.2.33) is written as ((a|y)) • \fi), we cannot afford to omit 
the dot and brackets because the resulting expression would look illegal.) Notice 
that the operator \fi)(a \ rotates | y) into the direction of |/3). It is easy to seethat if 



X=\P)(a\, (1-2.34) 

then 

X' = \a)(p\, (1.2.35) 



which is left as an exercise. 

In a second important illustration of the associative axiom, we note that 

((/J|).(X|a)) = (</J|X).(|a)). (1.2.36) 

bra ket bra ket 

*Later in the book we will encounter products like 101)1/6), which are more appropriately written 
as | cv ) ® |/3), but in such cases |a) and |/6) always refer to kets from different vector spaces. For 
instance, the first ket belongs to the vector space for electron spin, the second ket to the vector 
space for electron orbital angular momentum; or the first ket lies in the vector space of particle 1, 
the second ket i n the vector space o f particle 2 , and s o forth. 




1 .3 Base Kets and Matrix Representations 



17 



Because the two sides are equal, we might as well use the more compact notation 

W\a) (1.2.37) 

to stand for either side of (1.2.36). Recall now that {ot\X^ is the bra that is dual to 
X|a), so 



<0|X|a> = 08[-.(X|a» 

= {«a|Xt)-|/J>}* (1.2.38) 

where, in addition to the associative axiom, we used the fundamental property of 
the inner product (1.2.12). For a Hermitian X we have 

(P\X\a) = {a\X\P)*. (1.2.39) 

1.3 ■ BASE KETS AND MATRIX REPRESENTATIONS 
Eigenketsof an Observable 

Let us consider the eigenkets and eigenvalues of a Hermitian operator A. We use 
the symbol A, reserved earlier for an observable, because in quantum mechanics 
Hermitian operators of interest quite often turn out to be the operators representing 
some physical observables. 

We begin by stating an important theorem. 



Theorem 1.1. The eigenvalues of a Hermitian operator A are real; the eigenkets 
of A corresponding to different eigenvalues are orthogonal. 



Proof First, recall that 



A\a')=a'\a'). (1.3.1) 

Because A is Hermitian, we also have 

{a"\A = a"*(a"\, (1.3.2) 

where a',a ", ... are eigenvalues of A. If we multiply both sides of (1.3.1) by (a"\ 
on the left, multiply both sides of (1.3.2) by | a') on the right, and subtract, we 
obtain 



(a' -a"*){a"\a') = 0. (1.3.3) 

Now a! and a" can be taken to be either the same or different. Let us first choose 
them to be the same; we then deduce the reality condition (the first half of the 
theorem) 






18 



Chapter 1 Fundamental Concepts 



where we have used the faet that | a!) is not a nuil ket. Let us now assume a' and a" 
tobe different. Because of the just-proved reality condition, the difference a! — a"* 
that appears in (1.3.3) is equal to a! — a", which cannot vanish, by assumption. 
The inner product ( a"\a ') must then vanish: 

{a"\a) = 0, (a' # a), (1.3.5) 

which pro ves the orthogonality property (the second half of the theorem). 



We expect on physical grounds that an observable has real eigenvalues, a point 
that will become clearer in the next section, where measurements in quantum 
mechanics will be discussed. The theorem just proved guarantees the reality of 
eigenvalues whenever the operator is Hermitian. That is why we talk about Her- 
mitian observables in quantum mechanics. 

It is conventional to normalize | a') so that the { \a ') } form a orthonormal set: 

(a"\a) = & a "a>- (1.3.6) 

We may logically ask, Is this set of eigenkets complete? Because we started our 
discussion by asserting that the whole ket space is spanned by the eigenkets of A, 
the eigenkets of A must form a complete set by construction of our ket space.* 

Eigenkets as Base Kets 

We have seen that the normalized eigenkets of A form a complete orthonormal 
set. An arbitrary ket in the ket space can be expanded in terms of the eigenkets 
of A. In other words, the eigenkets of A are to be used as base kets in mueh the 
same way as a set of mutually orthogonal unit vectors is used as base vectors in 
Euclidean space. 

Given an arbitrary ket |a:) in the ket space spanned by the eigenkets of A, let 
us attempt to expand it as follows: 

I a) = Y J c a'W). (1.3.7) 

a' 

Multiplying {a"\ on the left andusing the orthonormality property (1.3.6), we can 
immediately find the expansion coefficient, 

Ca' = {a'\a). (1.3.8) 

In other words, we have 

|a}= X>'}(aV>, (1.3.9) 

a’ 

*The astute reader, already familiar with wave mechanics, may point out that the completeness of 
eigenfunetions we use can be proved by applying the Sturm-Liouville theory to the Schrodinger 
wave equation. But to “derive” the Schrodinger wave equation from our fundamental postulates, 
the completeness of the position eigenkets must be assumed. 




1 .3 Base Kets and Matrix Representations 



19 



which is analogous to an expansion of a vector V in (real) Euclidean space: 

Y = J^éi(ér\), (1.3.10) 

i 

where {é/} form an orthogonal set of unit vectors. We now recall that the asso- 
ciative axiom of multiplication: \a')(a'\a) can be regarded either as the number 
(i a'\a ) multiplying | a') or, equi valently, as the operator \a) (a'\ acting on |a). Be- 
cause |a) in (1.3.9) is an arbitrary ket, we must have 

Xyx^ 1 ’ d.3.11) 

a’ 

where the 1 on the right-hand side is to be understood as the identity operator. 
Equation (1 .3. 1 1) is known as the completeness relation or dosure. 

It is difficult to overestimate the usefulness of (1.3.1 1). Given a chain of kets, 
operators, or bras multiplied in legal orders, we can insert, in any place at our 
con venience, the identity operator written in form (1.3.11). Consider, f or example, 
(a|a) ; by inserting the identity operator between (a| and |a), we obtain 



{a\a) = (a | ■ • I«) 

=XVi“>i 2 - 



(1.3.12) 



This, incidentally, shows that if |a) is normalized, then the expansion coefficients 
in (1.3.7) must satisfy 



X>«'i 2 =E |(a ' |Q!)|2 = L (1 - 3 - 13) 

a' a' 

Let us now look at \a')(a'\ that appears in (1.3.11). Because this is an outer 
product, it must be an operator. Let it operate on la): 

(la'Xa'D- 1«) = \a')(a'\a) = c a ,\a'). (1.3.14) 

We see that \a') ( a'\ selects that portion of the ket |a) parallel to | a'), so | a') (a'\ is 
known as the projection operator along the base ket | a') and is denoted by A a r. 

A a ' = \a')(a'\. (1.3.15) 



The completeness relation (1.3.1 1) can now be written as 

Aa' — 1 - 

a' 



(1.3.16) 




20 



Chapter 1 Fundamental Concepts 



Matrix Representations 

Ha ving specified the base kets, we now show how to represent an operator, say X, 
by a square matrix. First, using (1.3. 1 1) twice, we write the operator X as 



*=EE \a"){a"\X\a'){a'\. (1.3.17) 

a" a' 



There are altogether N 2 numbers of form (a"\X\a'), where N is the dimensional- 
ity of the ket space. We may arrange them into an IV x IV square matrix such that 
the column and row indices appear as follows: 

(a"\ X | a') . (1.3.18) 

row column 



Explicitly we may write the matrix as 

/ <a (1) |X|a (1) ) (aWlXlaP)) 
XA {aW\X\a^) (a™ \X\a&) 

V 

where the symbol A stands for “is represented by.”* 
Using (1.2.38), we can write 



(1.3.19) 



(a"\X\a') = (a'\X r \a 



tu"\* 



(1.3.20) 



At last, the Hermi tian adjoint operation, originally defined by (1.2.24), has been 
related to the (perhaps more familiar) concept of complex conjugate transposed. 
If an operator B is Hermitian, we have 

(a"\B\a') = (a , \B\a'')*. (1.3.21) 

The way we arranged ( a"\X\a ') into a square matrix is in conformity with the 
usual rule of matrix multiplication. To see this, just note that the matrix represen- 
tation of the operator relation 



reads 



Z = XY 



(1.3.22) 



(a"\Z\a') = (a"\XY\a') 

= ^{a"\X\a m ){a"\Y\ci'). (1.3.23) 

a"' 

Again, all we have done is to insert the identity operator, written in form (1.3.11), 
between X and Y\ 



*We do not use the equality sign here because the particular form of a matrix representation 
depends on the particular choice of base kets used. The operator is different from a representation 
of the operator just as the actor is different from a poster of the actor. 




1 .3 Base Kets and Matrix Representations 



21 



Let us now examine how the ket relation 



I y) = X\a) (1.3.24) 

can be represented using our base kets. The expansion coefficients of \y) can be 
obtainedby mul tiply ing ( a'\ on the left: 

(a'\y) = (ci'\X\a) 

= J^(a'\X\a")(a"\a). (L3 ' 25) 



But this can be seen as an application of the rule for multiplying a square matrix 
with a column matrix, once the expansion coefficients of |a) and | y) arrange 
themselves to form column matrices as follows: 



\a) = 



( (a (1 V> \ 

(a (2) |a) 

{a^\a) 



V 



, ly) = 



/ 



Likewise, given 



( (^ (1) ly> \ 
(« (2) |y) 
(a (3) |y) 



V 



/ 



(y \ = (a\x. 



(1.3.26) 



(1.3.27) 



we can regard 

(y|a') = a\a")(a"\X\a '). 



So a bra is represented by a row matrix as follows: 



(y| = ((y l« (1) ), (y l« (2) ), (y l« (3) ), . . .) 

= ((a W \y)*,(a^\y)*,(a^\y)*,...). 



(1.3.28) 



(1.3.29) 



Note the appearance of complex conjugation when the elements of the column 
matrix are written as in (1.3.29). The inner product () 3\a ) can be written as the 
product of the row matrix representing (j3 1 with the column matrix representing 
la): 



(£|a) = a')(a'\a) 

a' 

= ((a(»lj3)*,(aW\<6)*,...) 



( 

\ 



(a^ la) 



(1.3.30) 



If we multiply the row matrix representing (a | with the column matrix represent- 
ing |/6), then we obtain just the complex conjugate of the preceding expression, 




22 



Chapter 1 Fundamental Concepts 



which is consistent with the fundamental property of the inner product (1.2.12). 
Finally, the matrix representation of the outer product |/3)(a| is easily seen tobe 



\P)( a \ = 



/ {a m \p)(a^\a)* 
(a (2) |£}(u (1) |a>* 



(a^\p)(a^\a)* 

(a ( - 2) \p){a^ 2) \a)* 



V 



... \ 
/ 



(1.3.31) 



The matrix representation of an observable A becomes particularly simple if 
the eigenkets of A themselves are used as the base kets. First, we have 



A = ^\a")(a"\A\a'){a'\. (1.3.32) 

a " a! 

But the square matrix {a"\A\a') is obviously diagonal, 

(a"\A\a!) = <a'|A|a')W =«'W. (1-3.33) 



A - Ya'\a')(a'\ 

(1.3.34) 

= Y a ' Aa '- 

a' 



Spin | Systems 

It is here instructive to consider the special case of spin j systems. The base kets 
used are |S Z ;±), which we denote, for brevity, as |±). The simplest operator in 
theket space spannedby |±) is theidentity operator, which, accordingto (1.3.11), 
can be written as 



l = l+)(+l + l-)(-|. (1-3.35) 

According to (1.3.34), we must be able to write S z as 

Sz=(fc/2)[(|+>(+l)-(|-)(-|)]. (1.3.36) 

The eigenket-eigenvalue relation 

5 z |±) = ±(ft/2)|±) (1.3.37) 

immediately follows from the orthonormality property of |±). 

I tis also instructive to look at two other operators, 

S + =h\+)(-\, S-=h\-)(+\, (1.3.38) 

which are both seen to be non-Hermitian. The operator S + , acting on the spin- 
do wn ket |— ), turns |— ) in to the spin-up ket |+) mul tiplied by Ti. On the other 




1 .4 Measurements, Observables, and the Uncertainty Relations 



23 



hånd, the spin-up ket |+), when acted upon by S+, becomes a nuli ket. So the 
physical interpretation of 5+ is that it raises the spin component by one unit of Ti ; if 
the spin component cannot be raised any further, we automatically get a nuli State. 
Likewise, 5L can be interpreted as an operator that lowers the spin component by 
one unit of Ti. Later we will show that S± can be written as S x ±iS y . 

In constructing the matrix representations of the angular momentum operators, 
it is customary to label the column (row) indices in descending order of angular 
momentum components; that is, the first entry corresponds to the maximum an- 
gular momentum component, the second to the next highest, and so forth. In our 
particular case of spin j systems, we have 



S* 




1 

0 




(1.3.39a) 



(1.3.39b) 



We will come back to these explicit expressions when we discuss the Pauli two- 
component formalism in Chapter 3. 



1 .4 ■ MEASUREMENTS, OBSERVABLES, AND THE UNCERTAINTY RELATIONS 



Measurements 

Having developed the mathematics of ket spaces, we are now in a position to 
discuss the quantum theory of measurement processes. This is not a particularly 
easy subject for beginners, so we first turn to the words of the great master, P. A. 
M. Dirac, for guidance (Dirac 1958, p. 36): “A measurement always causes the 
system to jump into an eigenstate of the dynamical variable that is being mea- 
sured.” What does all this mean? We interpret Dirac’s words as follows: Before 
a measurement of observable A is made, the system is assumed to be represented 
by some linear combination 

Ice) = (a'\a). (1.4.1) 

a' a' 

When the measurement is performed, the system is “thrown into” one of the 
eigenstates, say | a'), of observable A. In other words, 

I«) >1 a'). (1.4.2) 



For example, a silver atom with an arbitrary spin orientation will change into 
either | S z ; +) or | S z ; — ) when subjected to a SG apparatus of type SGz. Thus a 
measurement usually changes the State. The only exception is when the State is 
already in one of the eigenstates of the observable being measured, in which case 



, ,, A measurement . 

I a ) > I a ) 



(1.4.3) 




24 



Chapter 1 Fundamental Concepts 



with certainty, as will be discussed further. When the measurement causes la) 
to change in to | a'), it is said that A is measured to be a' . It is in this sense that 
the result of a measurement yields one of the eigenvalues of the observable being 
measured. 

Given (1.4.1), which is the State ket of a physical system before the measure- 
ment, we do not know in advance into which of the various | a!) ’s the system will 
be thrown as the result of the measurement. We do postulate, however, that the 
probability for jumping into some particular | a!) is given by 

Probability for a! = |(a'|a)| 2 , (1-4.4) 

pro vided that |a) is normalized. 

Although we have been talking about a single physical system, to determine 
probability (1.4.4) empirically, we must consider a great number of measurements 
perf ormed on an ensemble — that is, a collection — of identically prepared physical 
systems, all characterized by the same ket |a). Such an ensemble is known as 
a pure ensemble. (We will say more about ensembles in Chapter 3.) A beam 
of silver atoms that survive the first SGz apparatus of Figure 1.3 with the S z — 
component blocked is an example of a pure ensemble because every member 
atom of the ensemble is characterized by | S z ; +) . 

The probabilistic interpretation (1.4.4) for the squared inner product | (a'\a) | 2 
is one of the fundamental postulates of quantum mechanics, so it cannot be 
proved. Let us note, however, that it makes good sense in extreme cases. Suppose 
the State ket is | a') itself even before a measurement is made; then according to 
(1.4.4), the probability for getting o! — or, more precisely, for being thrown into 
| a!) — as the result of the measurement is predicted to be 1, which is just what 
we expect. By measuring A once again, we, of course, get \a’) only; quite gener- 
ally, repeated measurements of the same observable in succession yield the same 
result* If, on the other hånd, we are interested in the probability for the system 
initially characterized by | a') to be thrown into some other eigenket \a") with 
a i" yF a , then (1.4.4) gives zero because of the orthogonality between \a) and 
| a"). From the point of view of measurement theory, orthogonal kets correspond 
to mutually exclusive alternatives; for example, if a spin ^ system is in |5 Z ;+), it 
is not in |S Z ; — ) with certainty. 

Quite generally, the probability for anything must be nonnegative. Further- 
more, the probabilities for the various alternative possibilities must add up to 
unity. Both of these expectations are met by our probability postulate (1.4.4). 

We define the expectation value of A taken with respect to State |a) as 

(A) = (a|A|a). (1.4.5) 

To make sure that we are referring to State |a), the notation (A) a is sometimes 
used. Equation (1.4.5) is a definition; however, it agrees with our intuitive notion 

*Here successive measurements must be carried out immediately afterward. This point will be- 
come clear when we discuss the time evolution of a State ket in Chapter 2. 




1.4 Measurements, Observables, and the Uncertainty Relations 



25 



l Q ) 



FIGURE 1.7 Selective measurement. 

of average measured value because it can be written as 
(A) = \a")(a"\A\a')(a'\a.) 

a' 

= £ 

a' 

It is very important not to confuse eigenvalues with expectation values. For ex- 
ample, the expectation value of S z for spin j systems can assume any real value 
between — h/2 and +h/1, say 0.273 Tv, in contrast, the eigen value of S z assumes 
only two values, h/1 and —ti /I. 

To clarify further the meaning of measurements in quantum mechanics, we 
introduce the notion of a selective measurement, or filtration. In Section 1.1 we 
considered a Stern-Gerlach arrangement where we let only one of the spin compo- 
nents pass out of the apparatus while we completely blocked the other component. 
More generally, we imagine a measurement process with a device that selects only 
one of the eigenkets of A, say | a'), and rejects all others; see Figure 1.7. This is 
what we mean by a selective measurement; it is also called filtration because only 
one of the A eigenkets filters through the ordeal. Mathematically we can say that 
such a selective measurement amounts to applying the projection operator A a > to 
la): 



a 

t 

measured value a' 



KaW 

probability for obtaining a' 



(1.4.6) 




a") witha"^«' 



A a 'k0 = | a'){a'\a). (1.4.7) 

J. Schwinger has developed a formalism of quantum mechanics based on a 
thorough examination of selective measurements. He introduces a measurement 
symbol M(a' ) in the beginning, which is identical to A a > or \a'){a'\ in our no- 
tation, and deduces a number of properties of M(a') (and also of M(b',a') that 
amount to \b')(a'\) by studying the outcome of various Stern-Gerlach-type ex- 
periments. In this way he motivates the entire mathematics of kets, bras, and op- 
erators. In this book we do not follow Schwinger’s path; the interested reader may 
consult Gottfried (1966). 

Spin | Systems, Once Again 

Before proceeding with a general discussion of observables, we once again 
consider spin j systems. This time we show that the results of sequential 





Chapter 1 Fundamental Concepts 



Stern-Gerlach experiments, when combined with the postulates of quantum me- 
chanics discussed so far, are sufficient to determine not only the S x<y eigenkets, 
|5 X ;±) and 1 5" -y ; dr ) , but also the operators S x and S y themselves. 

First, we recall that when the S x + beam is subjected to an apparatus of type 
SGz, the beam splits into two components with equal intensities. This means that 
the probability for the S x + State to be thrown into | S z ; ±) , simply denoted as | ±) , 
is 2 each; hence, 

l(+|S*;+>| = IHS*;+>l = -L (1.4.8) 

. V2 

We can therefore construct the 5 X + ket as follows: 

\S x ;+) = ^\+) + -j=e iS '\-), (1.4.9) 

with <5i real. In writing (1.4.9) we haveused the faet that the overall phase (com- 
mon to both |+) and |— )) of a State ket is immaterial; the coefficient of |+) can 
be chosen to be real and positive by convention. The S x — ket must be orthogo- 
nal to the S x + ket because the S x + alternative and S x — alternative are mutually 
exelusive. This orthogonality requirement leads to 






(1.4.10) 



where we have, again, chosen the coefficient of |+) to be real and positive by 
convention. We can now construct the operator S x using (1.3.34) as follows: 



S x = ^[(\S x ;+)(S x -,+\)~(\S x --)(S x -,-\)] 
= |[e-^(l+)(-l) + ^‘(|-)(+|)]. 



(1.4.11) 



Notice that the S x we have constructed is Hermitian, just as it must be. A similar 
argument with S x replaced by S y leads to 



|S,;±) = 4=l+)±-U‘* 2 l->, 

V2 V2 



Sv = ^[^ 2 (|+)(-|) + ^ 2 (|-)(+l)]. 



(1.4.12) 

(1.4.13) 



Is there any way of determining <5 1 and S 2 1 Actually thereis one piece of infor- 
mation we have not yet used. Suppose we have a beam of spin | atoms moving in 
the z-direction. We can consider a sequential Stern-Gerlach experiment with SGx 
followed by SGy. The results of such an experiment are completely analogous to 
the earlier case leading to (1.4.8): 




1 .4 Measurements, Observables, and the Uncertainty Relations 



27 



which is not surprising in view of the invariance of physical systems under rota- 
tions. Inserting (1.4.10) and (1.4.12) into (1.4.14), we obtain 

-|l±e f(Sl_42) | = 4=, (1.4.15) 

2 V2 

which is satisfied only if 

S 2 -Si = ~ or -y. (1.4.16) 

We thus see that the matrix elements of S x and S y cannot all be real. If the S x 
matrix elements are real, the S y matrix elements must be purely imaginary (and 
vice versa). Just from this extremely simple example, the introduction of complex 
numbers is seen to be an essential feature in quantum mechanics. It is convenient 
to take the S x matrix elements to be real* and set S] = 0; if we were to choose 
8 \ — jr, the positive x-axis would be oriented in the opposite direction. The second 
phase angle 82 must then be — 7r/2 or n/2. The faet that there is still an ambiguity 
of this kind is not surprising. We have not yet specified whether the coordinate 
system we are using is right-handed or left-handed; given the x- and z-axes, there 
is still a twofold ambiguity in the choice of the positive y-axis. Later we will 
discuss angular momentum as a generator of rotations using the right-handed co- 
ordinate system; it can then be shown that <52 = 7t/2 is the correct choice. 

To summarize, we have 



and 



fc±> = 7! l+)± 7i H ’ 

|S,;±> = V2 I+)± 7I h ’ 



(1.4.17a) 



(1.4.17b) 



Sx = |[(l+><— 1) + (|— ><+l)], (1.4.18a) 

5, = |[-i(|+)(-|) + i(|-)(+|)]. (1.4.18b) 



The S x ± and S y ± eigenkets given here are seen to be in agreement with our ear- 
lier guesses (1.1.9) and (1.1.14) based on an analogy with linearly and circularly 
polarized light. (Note, in this comparison, that only the relative phase between 
the |+) and (— | components is of physical significance.) Furthermore, the non- 
Hermitian operators defined by (1.3.38) can now be written as 



S± = S x ±iS y . (1.4.19) 

*This can always be done by adjusting arbitrary phase factors in the definition of |+) and |— ). 
This point will become clearer in Chapter 3, where the behavior of |±) under rotations will be 
discussed. 




28 



Chapter 1 Fundamental Concepts 



The operators S x and S y , together with S z given earlier, can be readily shown 
to satisfy the commutation relations 

[Si,Sj] = ie ijk TiS k (1.4.20) 

and the anticommutation relations 

{Si,Sj} = ^h%, (1.4.21) 

where the commutator [ , ] and the anticommutator { , } are de fined by 



[A,B] = AB-BA, (1.4.22a) 

{A,B} = AB + BA. (1.4.22b) 

(We make use of the totally antisymmetric symbol eijk, which has the value +1 
for ei 23 and any cyclic permutation of indices; —1 for 6213 and any cyclic per- 
mutation of indices; and 0 when any two indices are the same.) The commutation 
relations in (1.4.20) will be recognized as the simplest realization of the angular- 
momentum commutation relations, whose significance will be discussed in detail 
in Chapter 3. In contrast, the anticommutation relations in (1.4.21) turn out to be 
a special property of spin \ systems. 

We can also define the operator S ■ S, or S 2 for short, as follows: 

S 2 = S 2 + Sj + S 2 . (1.4.23) 

Because of (1.4.21), this operator turns out to be just a constant multiple of the 
identity operator 



S 2 = 




(1.4.24) 



We obviously have 



[S 2 ,£]=0. 



(1.4.25) 



As will be shown in Chapter 3, for spins higher than 5 , S 2 is no longer a multiple 
of the identity operator; however, (1.4.25) still holds. 



Compatible Observables 

Returning now to the general formalism, we will discuss compatible versus in- 
compatible observables. Observables A and B are defined to be compatible when 
the corresponding operators commute, 

[A,B] = 0, (1.4.26) 



and to be incompatible when 



[A,B]? 0 . 



(1.4.27) 




1 .4 Measurements, Observables, and the Uncertainty Relations 



29 



For example, S 2 and S z are compatible observables, whereas S x and S? are in- 
compatible observables. 

Let us first consider the case of compatible observables A and B. As usual, we 
assume that the ket space is spanned by the eigenkets of A. We may also regard the 
same ket space as being spanned by the eigenkets of B. We now ask, How are the 
A eigenkets related to the B eigenkets when A and B are compatible observables? 

Before answering this question we must touch upon a very important point 
we have bypassed earlier — the concept of degeneracy. Suppose there are two (or 
more) linearly independent eigenkets of A having the same eigenvalue; then the 
eigenvalues of the two eigenkets are said to be degenerate. In such a case the no- 
tation | a') that labels the eigenket by its eigenvalue alone does not give acomplete 
description; furthermore, we may recall that our earlier theorem on the orthogo- 
nality of different eigenkets was proved under the assumption of no degeneracy. 
Even worse, the whole concept that the ket space is spanned by {I«')} appears to 
run into difficulty when the dimensionality of the ket space is larger than the num- 
ber of distinet eigenvalues of A. Fortunately, in practical applications in quantum 
mechanics, it is usually the case that in such a situation the eigenvalues of some 
other commuting observable, say B, can be used to label the degenerate eigenkets. 

Now we are ready to State an important theorem. 



Theorem 1.2. Suppose that A and B are compatible observables, and the eigen- 
values of A are nondegenerate. Then the matrix elements {a"\B \a) are all diag- 
onal. (Recall here that the matrix elements of A are already diagonal if { \a')} are 
used as the base kets.) 



Proof. The proof of this important theorem is extremely simple. Using the defi- 
nition (1 .4.26) of compatible observables, we observe that 

(a"\[A,B]\a') = {a" - a')(a"\B\a') = 0. (1.4.28) 

Therefore, (a"\B \a') must vanish unless a = a" , which proves our assertion. 



We can write the matrix elements of B as 

(a"\B\a') = S a r a »(a'\B\a'). (1.4.29) 

So both A and B can be represented by diagonal matrices with the same set of 
base kets. Using (1.3. 17) and (1.4.29), we can write B as 

B = J2\a")(a'\B\a")(a"\. (1.4.30) 

a" 

Suppose that this operator acts on an eigenket of A: 

B\a') = \a") (a" \B\a") (a" \a') — (( a'\B\a'))\a' ). 



a 



(1.4.31) 







30 



Chapter 1 Fundamental Concepts 



But this is nothing other than the eigenvalue equation for the operator B with 
eigenvalue 



b' = (a'\B\a'). (1.4.32) 

The ket | a') is therefore a simultaneous eigenket of A and B. Just to be impartial 
to both operators, we may use | a',b') to characterize this simultaneous eigenket. 

We have seen that compatible observables have simultaneous eigenkets. Even 
though the proof given is for the case where the A eigenkets are nondegenerate, 
the statement holds even if there is an n-fold degeneracy. That is, 

A\a'^) = a'\a'^) for i = 1,2,...,«, (1.4.33) 

where | are n mutually orthonormal eigenkets of A, all with the same eigen- 
value a'. To see this, all we need to do is construct appropriate linear combinations 
of | a ,( - 1 ’) that diagonalize the B operator by following the diagonalization proce- 
dure to be discussed in Section 1.5. 

A simultaneous eigenket of A and B, denotedby | a',b'), has the property 

A\a',b') = a'\a',b'), (1.4.34a) 

B\a',b') = b'\ci',b'). (1.4.34b) 

When there is no degeneracy, this notation is somewhat superfluous because it is 

clear from (1.4.32) that if we specify a', we necessarily knowthe// that appears in 
\a',b'). The notation | a',b') is much more powerful when there are degeneracies. 
A simple example may be used to illustrate this point. 

Even though a complete discussion of orbital angular momentum will not ap- 
pear in this book until Chapter 3, the reader may be aware from his or her earlier 
training in elementary wave mechanics that the eigenvalues of L 2 (orbital angu- 
lar momentum squared) and L z (the z-component of orbital angular momentum) 
are h 2 l(l + 1) and m/ft, respectively, with l an integer and m/ = —l, — l + 1,..., 
+/. To characterize an orbital angular-momentum State completely, it is neces- 
sary to specify both l and m/. For example, if we just say l = 1, the m/ value can 
still be 0, +1, or — 1; if we just say m/ = 1, / can be 1,2, 3, 4, and so on. Only 
by specifying both l and m/ do we succeed in uniquely characterizing the orbital 
angular-momentum State in question. Quite often a collective index K' is used to 
stand for (a r ,b'), so that 



\K') = \a',b'). (1.4.35) 

We can obviously generalize our considerations to a situation where there are 
several (more than two) mutually compatible observables, namely, 

[A, B] = [B, C] = [A,C] = ■ ■ ■ = 0. (1.4.36) 

Assume that we have found a maximal set of commuting observables; that is, 
we cannot add any more observables to our list without violating (1.4.36). The 
eigenvalues of individual operators A, B, C, ... may have degeneracies, but if we 




1 .4 Measurements, Observables, and the Uncertainty Relations 



31 



specify a combination ( a b', c', . . .), then the corresponding simultaneous eigen- 
ket of A,B,C,... is uniquely specified. We can again use a collective index K' to 
stand for {q! , b', c', ...). The orthonormality relation for 

| K') = \a',b',c',...) (1.4.37) 

reads 

( K"\K ') —8k'K" = &aa'&bb'&cd *" , (1.4.38) 

and the completeness relation, or closure, can be written as 

1 k ') < k ' i = y , E E • • • i fl/ > b '> c '> • • •> < fl/ > b '’ c '* • • • i = i • ( 1 - 4 - 39 ) 

K' a' b' d 

We now consider measurements of A and B when they are compatible observ- 
ables. Suppose we measure A first and obtain result a! . Subsequently, we may 
measure B and get result b' . Finally we measure A again. It follows from our mea- 
surement formalism that the third measurement always gives a! with certainty; 
that is, the second (B) measurement does not destroy the previous information 
obtained in the first (A) measurement. This is rather obvious when the eigenval- 
ues of A are nondegenerate: 

læ) 1 a ,b ) > | a ,b ) ► | a ,b ). (1.4.40) 

When there is degeneracy, the argument goes as follows: After the first (A) 
measurement, which yields a', the system is thrown into some linear combination 

n 

Y J c { SW,b^ ) ), (1.4.41) 

i 

where n is the degree of degeneracy and the kets | a', b (l ^) all have the same eigen- 
value o! as far as operator A is concemed. The second (5) measurement may select 
just one of the terms in the linear combination (1.4.41) — say, \a',b^) — but the 
third (A) measurement applied to it still yields a' . Whether or not there is degener- 
acy, A measurements and B measurements do not interfere. The term compatible 
is indeed deemed appropriate. 

I ncompatible Observables 

We now turn to incompatible observables, which are more nontrivial. The first 
point to be emphasized is that incompatible observables do not have a complete 
set of simultaneous eigenkets. To show this, let us assume the converse to be true. 
There would then exist a set of simultaneous eigenkets with property (1.4.34a) 
and (1.4.34b). Clearly, 

AB\a',b')=Ab'\a',b')=a'b'\a',b'). (1.4.42) 




32 



Chapter 1 Fundamental Concepts 




(a) 





(b) 

FIGURE 1.8 Sequential selective measurements. 



Likewise, 



BA\a',b') = Ba'\a!,b')=a!b'\a',b')\ (1.4.43) 



hence, 



AB\a',b ') = BA\a',b'), (1.4.44) 

and thus [A,B | = 0 in contradiction to the assumption. So, in general, | a',b') 
does not make sense for incompatible observables. There is, however, an interest- 
ing exception; it may happen that there exists a subspace of the ket space such that 
(1.4.44) holds for all elements of this subspace, even though A and B are incom- 
patible. An example from the theory of orbital angular momentum may be helpful 
here. Suppose we consider an / = 0 State (s-state). Even though L x and L z do not 
commute, this State is a simultaneous eigenstate of L x and L z (with eigenvalue 
zero for both operators). The subspace in this case is one-dimensional. 

We already encountered some of the peculiarities associated with incompati- 
ble observables when we discussed sequential Stern-Gerlach experiments in Sec- 
tion 1.1. We now give a more general discussion of experiments of that type. 
Consider the sequence of selective measurements shown in Figure 1.8a. The first 
(A) filter selects some particular | a') and rejects all others, the second (B) filter 
selects some particular | b') and rejects all others, and the third (C) filter selects 
some particular |c') and rejects all others. We are interested in the probability of 
obtaining | c') when the beam coming out of the first filter is normalized to unity. 
Because the probabilities are multiplicative, we obviously have 

|<c> / )| 2 |(W>| 2 . (1-4.45) 

Now let us sum over b' to consider the total probability for going through all 
possible b' routes. Operationally this means that we first record the probability of 









1 .4 Measurements, Observables, and the Uncertainty Relations 



33 



obtaining c' with all but the first b' route blocked; then we repeat the procedure 
with all but the second b' blocked, and so on; then we sum the probabilities at the 
end and obtain 



J2 1 (c'\b')\ 2 \(b'W) I 2 = (b' v> (a’W) (b'W ) . (1 .4.46) 

b 1 V 

We now compare this with a different arrangement, where the B filter is absent 
(or not operative); seeFigure 1.8b. Clearly, the probability is just \{c'\a')\ 2 , which 
can also be written as follows: 



l<cV>| 2 = |£<c>'>(&V> 

b> 



^^(c>')(&'|a')(a'l&")(&"|c). (1.4.47) 
b> b" 



Notice that expressions (1.4.46) and (1.4.47) are different! This is remarkable 
because in both cases the pure | a') beam coming out of the first (4) filter can be 
regarded as being made up of the B eigenkets 

I a') = Y J \b'){b'\a), (1.4.48) 

b' 



where the sum is over all possible values of b'. The crucial point to be noted is that 
the result coming out of the C filter depends on whether or not B measurements 
have actually been carried out. In the first case, we experimen tally ascertain which 
of the B eigenvalues are actually realized; in the second case, we merely imagine 
| a') to be built up of the various \b') ’s in the sense of (1.4.48). Put in another way, 
actually recording the probabilities of going through the various b' routes makes 
all the difference even though we sum over b' afterwards. Here lies the heart of 
quantum mechanics. 

Under what condi tions do the two expressions becomeequal? It is left as an ex- 
ercise for the reader to show that for this to happen, in the absence of degeneracy, 
it is sufficient that 



[A,B] = 0 or [B,C] = 0. (1.4.49) 

In other words, the peculiarity we have illustrated is characteristic of incompatible 
observables. 

The Uncertainty Relation 

The last topic to be discussed in this section is the uncertainty relation. Given an 
observable 4, we define an operator 



A4 = 4 — (4), (1.4.50) 

where the expectation value is to be taken for a certain physical State under consid- 
eration. The expectation value of (A4) 2 is known as the dispersion of 4. Because 
we have 



((A4) 2 ) = ((4 2 - 24(4) + (4> 2 )) = (4 2 ) - (4) 2 , 



(1.4.51) 




34 



Chapter 1 Fundamental Concepts 



the last line of (1.4.51) may be taken as an alternative definition of dispersion. 
Sometimes the terms variance and mean square deviation are used for the same 
quantity. Clearly, the dispersion vanishes when the State in question is an eigen- 
state of A. Roughly speaking, the dispersion of an observable characterizes “fuzzi- 
ness.” For example, for the S z + State of a spin j system, the dispersion of S x can 
be computed to be 



(S 2 )-(S x ) 2 =h 2 / 4. (1.4.52) 

In contrast the dispersion ((A S z ) 2 ) obviously vanishes for the S z + State. So, for 
the S z + State, S z is “sharp” — a vanishing dispersion for S z — whereas S x is fuzzy. 

We now State the uncertainty relation, which is the generalization of the well- 
known x-p uncertainty relation to be discussed in Section 1.6. Let A and B be 
observables. Then, for any State, we must have the following inequality: 

<(AA) 2 X(AZ?) 2 > > ^|([A,B]}| 2 . (1.4.53) 



To prove this we first State three lemmas. 



Lemma 1.1. The Schwarz inequality 




(a\a)(m>MP)\ 2 , 


(1.4.54) 


which is analogous to 




|a| 2 |b| 2 > |a-b| 2 


(1.4.55) 


in real Euclidian space. 




Proof. First note that 




«a| + A*</J|).(|a)+A|/J»>0, 


(1.4.56) 


where X can be any complex number. This inequality must hold when X is set 
equalto ~{P\a) rø): 


(a\a)(P\^)-\(a\m 2 >0, 


(1.4.57) 


which is the same as (1.4.54). 




Lemma 1.2. The expectation value of a Hermitian operator is purely real. 


Proof. The proof is trivial — just use (1.3.21). 


Lemma 1.3. The expectation value of an anti-Hermitian operator, defined by 
C = — Cf is purely imaginary. 




1.5 Change of Basis 



35 



Proof The proof is trivial. 



Armed with these lemmas, we are in a position to provethe uncertainty relation 
(1.4.53). Using Lemma 1 with 



I«) = AA|), 
1/8) = Afi|), 



(1.4.58) 



where the blank ket | ) emphasizes the faet that our consideration may be applied 
to any ket, we obtain 

<(AA) 2 )((AB) 2 ) > \(AAAB)\ 2 , (1.4.59) 



where the Hermiticity of A A and AB has been used. To evaluate the right-hand 
side of (1.4.59), we note 

1 1 

AAAB — ~[AA,AB] + -{AA,AB}, (1.4.60) 

where the commutator [AA,Afi], which is equal to [A,fi], is clearly anti- 
Hermitian 

([A, fi]) f = (AB - BA ) f = BA — AB = — [ A,B ] . (1.4.61) 

In contrast, the anticommutator { AA, Afi} is obviously Hermitian, so 

(AAAB)= l - <[A,B]> + ^ ({AA, AB}), (1.4.62) 

2 purely imaginary 2 purely real 

where Lemmas 2 and 3 have been used. The right-hand side of (1.4.59) now be- 
comes 

|(AAAfi)| 2 = ^|([A,fi])| 2 + ^|({AAAfi})| 2 . (1.4.63) 

The proof of (1.4.53) is now complete because the omission of the second (the 
anticommutator) term of (1.4.63) can only make the inequality relation stronger.* 
Applications of the uncertainty relation to spin ^ systems will be left as exer- 
cises. We come back to this topic when we discuss the fundamental x-p commu- 
tation relation in Section 1.6. 



1,5 ■ CHANGE OF BASIS 

Transformation Operator 

Suppose we have two incompatible observables A and B. The ket space in question 
can be viewed as being spanned either by the set (la')} or by the set {| b')}. For 

*In the literature most au thors use AA for our yj ((AA) 2 ), so the uncertainty relation is written as 
AAAB > j|([A,.B])|. In this book, however, A.4 and AB are to be understood as operators [see 
(1.4.50)], not numbers. 






36 



Chapter 1 Fundamental Concepts 



example, for spin j systems |5^±) may be used as our base kets; alternatively, 
|S Z ±) may be used as our base kets. The two different sets of base kets, of course, 
spån the same ket space. We are interested in finding out how the two descriptions 
are related. Changing the set of base kets is referred to as a change of basis or 
a change of representation. The basis in which the base eigenkets are given by 
(|a')} is called the A representation or, sometimes, the A diagonal representation 
because the square matrix corresponding to A is diagonal in this basis. 

Our fundamental task is to construct a transformation operator that connects 
the old orthonormal set {la')} and the new orthonormal set {| £>')}. To this end, we 
first show the following. 



Theorem 1.3. Given two sets of base kets, both satisfying orthonormality and 
completeness, there exists a unitary operator U such that 

| b a) ) = U\a w ), | b {2) ) = U\a (2) ),. . . , |6 W ) = U\a {N) ). (1.5.1) 

By a unitary operator we mean an operator fulfilling the condi tions 

U*U = 1 (1.5.2) 

and 

UU* = 1. (1.5.3) 

Proof We prove this theorem by explicit construction. We assert that the operator 

U = J2\b {k) )(a {k) \ (1.5.4) 

k 

will do the job, and we apply this U to | a®). Clearly, 

U\a (l) ) = | b (l) ) (1.5.5) 

is guaranteed by the orthonormality of (la')}- Furthermore, U is unitary: 

a M)(b (l) \b (k) ){a (k) \ = \a [k) ){a {k) \ = 1, (1.5.6) 

k l k 

where we have used the orthonormality of {\b') } and the completeness of (la'))- 
We obtain relation (1.5.3) in an analogous manner. 



Transformation Matrix 

It is instructive to study the matrix representation of the U operator in the old 
(la')} basis. We have 



(a {k} \U\a (l) ) = {a (k) \b (l) ). 



(1.5.7) 




1 .5 Change of Basis 



37 



which is obvious from (1.5.5). In other words, the matrix elements of the U op- 
erator are built up of the inner products of old basis bras and new basis kets. We 
recall that the rotation matrix in three dimensions that changes one set of unit ba- 
sis vectors (x,y,z) into another set (x',y',z') can be written as (Goldstein (2002), 
pp. 134-144, for example) 



/ x*x ; x*y' x-z.' \ 

R = ( y-x' y-y' y*z' . (1.5.8) 

\ z-x' z-y' z • z / J 



The square matrix made up of (a^\U\a^) is referred to as the transformation 
matrix from the { | ; ) } basis to the {| b')} basis. 

Given an arbitrary ket |a) whose expansion coefficients (a'\a) are known in 
the old basis, 



\a) = ^|a')(a'|a), (1-5.9) 

a' 

how can weobtain ( b'\a ), the expansion coefficients in the new basis? The answer 
is very simple: Just multiply (1.5.9) (with cl' replaced by a (t> to avoid confusion) 
by {b ^ |; 

(l h {k) \a ) = ^(& (fe) |a (/) )(a w |a) = J](a (fe) |[/ t |a (0 )(a (0 |a). (1.5.10) 

l l 

In matrix notation, (1.5. 10) States that the column matrix for la) in the new basis 
can be obtainedjust by applyingthe square matrix to the column matrix in the 
old basis: 



(New) = ([/ + )(old). (1.5.11) 

The relationships between the old matrix elements and the new matrix elements 
are also easy to obtain: 

(b ik) \X\b^ l) ) = J](^ ) l« ( ' n) )(a ( ' n) |X|a ( ' l) )(a (n) |& (/) ) 



m n 

= ^J^(a ( * ) |[ /t |a (m) ) (a {m) \ X \ a {n) )(a M \U\a^). 

m n 



(1.5.12) 



This is simply the well- known formulafor a similarity transformation in matrix 
algebra, 

X' = U^XU. (1.5.13) 



The trace of an operator X is defined as the sum of diagonal elements: 

tr(X) = £<a'|X|a'). 

a' 



(1.5.14) 




38 



Chapter 1 Fundamental Concepts 



Even though a particular set of base kets is used in the definition, tr(X) turns out 
to be independent of representation, as shown: 

^(a'\X\a)=^^2^2(a , \ b ')(b'\X\b' , )(b ,, \a') 

a! a! b ' b" 

b 1 b" 

= Y J {b’\X\b'). 

b' 

We can also pro ve 

tr(XF) = tr(Y X), 
tr (U^XU) = tr(X), 
tT(\a')(a"\) = 8 a , a", 
tr(\b , )(a , \) = (a , \b'). 

Diagonalization 

So far we have not discussed how to find the eigenvalues and eigenkets of an op- 
erator B whose matrix elements in the old {|a')} basis are assumed to be known. 
This problem turns out to be equivalent to that of finding the unitary matrix that 
diagonalizes B. Even though the reader may already be familiar with the diago- 
nalization procedure in matrix algebra, it is worth working out this problem using 
the Dirac bra-ket notation. 

We are interested in obtaining the eigenvalue b' and the eigenket \b') with the 
property 



(1.5.15) 



(1.5.16a) 

(1.5.16b) 

(1.5.16c) 

(1.5.16d) 



B\b')=b'\b'). 



(1.5.17) 



First, we rewrite this as 



J2(a"\B\a')(a'\b') =b'{a"\b'). (1.5.18) 

a' 



When | b') in (1.5.17) stands for the Zth eigenket of operator B, we can write 
(1.5.18) in matrix notation as follows: 



Bu 

B21 



B12 B 13 

B22 B23 



\ / cf \ 



c 



(/) 



/ cf \ 



=b® 



c 



(/) 



(1.5.19) 



with 



Bij = {a (l) \B\a <J) ) 



(1.5.20a) 




1.5 Change of Basis 



39 



and 



Cf = { a {k) \b (l) ), (1.5.20b) 

where i, j, k run up to N, the dimensionality of the ket space. As we know from 
linear algebra, nontrivial solutions for cj^ are possible only if the characteristic 
equation 

det(5 — Al) = 0 (1.5.21) 

is satisfied. This is an Mh-order algebraic equation for k, and the N roots obtained 
are to be identified with the various b® ’s we are trying to determine. Knowing 
b {l \ we can solve for the corresponding C^’s up to an overall constant to be 
determined from the normalization condition. Comparing (1.5.20b) with (1.5.7), 
we see that the Cjp’s are just the elements of the unitary matrix involved in the 
change of basis (la')} ~ 

For this procedure the Hermiticity of B is important. For example, consider 
S+ definedby (1.3.38) or (1.4.19). This operatoris obviously non-Hermitian. The 
corresponding matrix, which reads in the S z basis as 

= q q (1.5.22) 

cannot be diagonalized by any unitary matrix. In Chapter 2 we will encounter 
eigenkets of a non-Hermitian operator in connection with a coherent State of a 
simple harmonic oscillator. Such eigenkets, however, are known not to form a 
complete orthonormal set, and the formalism we have developed in this section 
cannot be immediately applied. 

Unitary Equi valent Observables 

We conclude this section by discussing a remarkable theorem on the unitary trans- 
form of an observable. 



Theorem 1.4. Consider again two sets of orthonormal basis {|a')} and {| b')} 
connected by the U operator (1.5.4). Knowing U, we may construct a unitary 
transform of A, UAU ~ l ; then A and UAU~ ] are said to be unitary equivalent 
observables. The eigenvalue equation for A, 

A\a (l) ) =a {1) \a {l) ), (1.5.23) 



clearly implies that 

UAU~ l U\a (l) ) =a (l) U\a (l) ). (1.5.24) 

But this can be rewritten as 



(UAU~ l )\b (l) ) =a (l) \b (t) }. 



(1.5.25) 





40 



Chapter 1 Fundamental Concepts 



This deceptively simple result is quite profound. It tells us that the |Z/)’s are 
eigenkets of UAU~ X with excictly the same eigenvalues as the A eigenvalues. In 
other words, unitary equivalent observables have identical spectra. 

The eigenket | b®), by definition, satisfies the relationship 

B\b {l) ) =b {l) \b {l) ). (1.5.26) 

Comparing (1.5.25) and (1.5.26), we infer that B and UAU~ X are simultaneously 
diagonalizable. A natural question is, Is UAU~ l the same as B itself? The answer 
quite often is yes in cases of physical interest. Take, forexample, S x and S z . They 
are related by a unitary operator, which, as we will discuss in Chapter 3, is actually 
the rotation operator around the y-axis by angle tc /2. In this case S x itself is the 
unitary transform of S z . Because we know that S x and S z exhibit the same set 
of eigenvalues — namely, +h/2 and —h/2 — we see that our theorem holds in this 
particular example. 



1.6 ■ POSITION, MOMENTUM, AND TRANSLATION 



Continuous Spectra 

The observables considered so far have all been assumed to exhibit discrete eigen- 
value spectra. In quantum mechanics, however, there are observables with con- 
tinuous eigenvalues. Take, for instance, p z , the z-component of momentum. In 
quantum mechanics this is again represented by a Hermitian operator. In contrast 
to S z , however, the eigenvalues of p z (in appropriate units) can assume any real 
value between —oo and oo. 

The rigorous mathematics of a vector space spanned by eigenkets that exhibit 
a continuous Spectrum is rather treacherous. The dimensionality of such a space 
is obviously infinite. Fortunately, many of the results we worked out for a finite- 
dimensional vector space with discrete eigenvalues can immediately be gener- 
alized. In places where straigh tforward generaliza tions do not hold, we indicate 
danger signals. 

We start with the analogue of eigenvalue equation (1.2.5), which, in the 
continuous-spectrum case, is written as 



where £ is an operator and £' is simply a number. The ket |£') is, in other words, 
an eigenket of operator £ with eigenvalue £', just as | a') is an eigenket of operator 
A with eigenvalue a! . 

In pursuing this analogy we replace the Kronecker symbol by Dirac’s S- 
function — a discrete sum over the eigenvalues {a'} by an integral over the contin- 
uous variable £' — so 



( a'\a ") =8 a ’a" -> (£'|£"> = S(£' -£")> 



Xyx «'^ 1 

a' 



I d? im'i = i, 



(1.6.2a) 



(1.6.2b) 




1 .6 Position, Momentum, and Translation 



41 



| tt ) = Y j \a'){a'\ a ) -> \a) = J d? \?W\a), 

o! 


(1.6.2c) 


£|<a'|a)| 2 = l-H> 1 d%' |(£V)| 2 = 1, 

a r 


(1.6.2d) 


</}|a) = £</l |a') («'!<*)-> (P\a) = J d? (0 £')(£», 

a' 


(1.6.2e) 


{a"\A\a) = a'8 a ' a " -+ <$"!£ |$'> =£'«($" -$'). 


(1.6.2f) 



Notice in particular how the completeness relation (1.6.2b) is used to obtain 
(1.6.2c) and (1.6.2e). 

Position Eigenkets and Position Measurements 

In Section 1.4 we emphasized that a measurement in quantum mechanics is es- 
sentially a filtering process. To extend this idea to measurements of observables 
exhibiting continuous spectra, it is best to work with a specific example. To this 
end we consider the position (or coordinate) operator in one dimension. 

The eigenkets |x ') of the position operator x satisfying 

x\x')=x'\x') (1.6.3) 

are postulatedto form a complete set. Here x' is just a number with the dimension 
of length 0.23 cm, for example, whereas x is an operator. The State ket for an 
arbitrary physical State can be expanded in terms of { |x') } : 

/ OO 

dx\x')(x\a). (1.6.4) 

-00 

We now consider a highly idealized selective measurement of the position ob- 
servable. Suppose we place a very tiny detector that clicks only when the particle 
is precisely at x' and nowhere else. Immediately after the detector clicks, we can 
say that the State in question is represented by \x'). In other words, when the de- 
tector clicks, |o') abruptly “jumps into” |x') in much the same way as an arbitrary 
spin State jumps into the S z + (or S z —) State when subjected to an SG apparatus 
of the S z type. 

In practice, the best the detector can do is to locate the particle within a narrow 
interval around x' . A realistic detector clicks when a particle is observed to be lo- 
cated within some narrow range (x' — A /2,x' + A/2). When acountis registered 
in such a detector, the State ket changes abruptly as follows: 

\a)= [ dx"\x"){x"\a) m ! . asureme 2 !> . T + ' dx"\x"){x"\a). (1.6.5) 

J— oo Jx'— A/2 

Assuming that ( x"\a ) does not change appreciably within the narrow interval, the 
probability for the detector to click is given by 

\{x' \a)\ 2 dx r , 



( 1 . 6 . 6 ) 




42 



Chapter 1 Fundamental Concepts 



where we have written dx' for A. This is analogous to | (a \a) \ 2 for the probability 
for la) to be thrown into | a') when A is measured. The probability of recording 
the particle somewhere between —oo and oo is given by 




dx'\{x'\a)\ 2 . 



(1.6.7) 



which is normalized to unity if |a) is normalized: 



{a\a) = 1 = 4 > 



dx' (a\x')(x'\a) = 1. 



( 1 . 6 . 8 ) 



The reader familiar with wave mechanics may have recognized by this time 
that {x'\a) is the wave function for the physical State represented by |a). We 
will say more about this identification of the expansion coefficient with the x- 
representation of the wave function in Section 1.7. 

The notion of a position eigenket can be extended to three dimensions. It is 
assumed in nonrelativistic quantum mechanics that the position eigenkets |x') are 
complete. The State ket for a particle with internal degrees of freedom, such as 
spin, ignored can therefore be expanded in terms of { |x'}} as follows: 



I a) - 



/ 



d i x'\x'){x'\a). 



(1.6.9) 



where x' stands for x', y' , and z'\ in other words, |x') is a simultaneous eigenket 
of the observables x, y, and z in the sense of Section 1.4: 

Ix') = | x',y',z'), (1.6.10a) 

x|x') = x'|x'), y|x') = y'|x'), z|x') = z'|x'), (1.6.10b) 

To be able to consider such a simultaneous eigenket at all, we are implicitly as- 
suming that the three components of the position vector can be measured simul- 
taneously to arbitrary degrees of accuracy; hence, we must have 

[xi,xj] = 0, (1.6.11) 

where x\, X 2 , and X 3 stand for x, y, and z, respectively. 

Translation 

We now introduce the very important concept of translation, or spatial displace- 
ment. Suppose we start with a State that is well localized around x'. Let us con- 
sider an operation that changes this State into another well-localized State, this 
time around x 7 + dx', with everything else (for example, the spin direction) un- 
changed. Such an operation is defined to be an infinitesimal translation by dx', 
and the operator that does the job is denotedby %{dx!)\ 



$(dx') |x') = Ix' + rfx'), 



( 1 . 6 . 12 ) 




1 .6 Position, Momentum, and Translation 



43 



where a possible arbitrary phase factor is set to unity by convention. Notice that 
the right-hand side of (1.6.12) is again a position eigenket, but this time with 
eigenvaluex' + dx'. Obviously |x') is not an eigenket of the infinitesimal transla- 
tion operator. 

By expanding an arbitrary State ket |a) in terms of the position eigenkets, we 
can examine the effect of infinitesimal translation on |a): 

la) %(dx')\a) = $(dx') J rf 3 jt'|x')(x'|a) = J d 3 x'\x' + dx')(x'\a). (1.6.13) 

We also write the right-hand side of (1.6.13) as 

J d 3 x'\x' +dx!)(x'\ct) = J d 3 x'\x')(x' - dx!\ct) (1.6.14) 

because the integration is over all space and x' is just an integration variable. 
This shows that the wave function of the translated State fr{dx')\a) is obtained by 
substituting x' — dx! for x' in (x'|a). 

There is an equivalent approach to translation that is often treated in the lit- 
erature. Instead of considering an infinitesimal translation of the physical system 
itself, we consider a change in the coordinate system being used such that the 
origin is shifted in the opposite direction, —dx'. Physically, in this alternative 
approach we are asking how the same State ket would look to another observer 
whose coordinate system is shifted by —dx' . In this book we try not to use this 
approach. Obviously it is important that we do not mix the two approaches! 

We now list the properties of the infinitesimal translation operator %(—dx'). 
The first property we demand is the unitarity property imposed by probability 
conservation. It is reasonable to require that if the ket |a) is normalized to unity, 
the translated ket fr{dx')\a) also be normalized to unity, so 

(a\a) = (a\f(dx!)$(dx')\a). (1.6.15) 

This condition is guaranteedby demanding that the infinitesimal translation be 
unitary: 

$\dx')$(dx') = 1. (1.6.16) 

Quite generally, the norm of a ket is preserved under unitary transformations. 
For the second property, suppose we consider two successive infinitesimal 
translations — first by dx' and subsequently by dx", where dx' and dx" need 
not be in the same direction. We expect the net result to be just a single translation 
operation by the vector sum dx' + dx", so we demand that 

$(dx")$(dx') = $(dx' + dx"). (1.6.17) 

For the third property, suppose we consider a translation in the opposite direction; 
we expect the opposite-direction translation to be the same as the inverse of the 
original translation: 



$(-dx') = $-\dx'). 



(1.6.18) 




44 



Chapter 1 Fundamental Concepts 



For the fourth property, we demand that as dx' — > 0, the translation operation 
reduce to the identity operation 

lim $(dx') = 1 (1.6.19) 

and that the difference between fl.(dx') and the identity operator be of first order 
in dx'. 

We now demonstrate that if we take the infinitesimal translation operator to be 

$(dx') = 1-iK-dx', (1.6.20) 

where the components of K, K x , K y , and K z , are Hermitian operators, then 
all the properties listed are satisfied. The first property, the unitarity of $(dx'), is 
checked as follows: 

%\dx')${dx') = (1 + /K* • dx')( 1 - iK • dx') 

= 1 -/(K-tf) -dx' + 0[(dx') 2 ] (1.6.21) 

- 1 , 

where terms of second order in dx' have been ignored for an infinitesimal trans- 
lation. Thesecond property (1.6.17) can also be proved as follows: 

$(dx")$(dx') = (1 -;K-rfx")(l -iK-dx') 

~ 1 -iK-(dx' +dx") (1.6.22) 

= %{dx! + dx”). 

The third and fourth properties are obviously satisfied by (1.6.20). 

Accepting (1.6.20) to be the correct form for fl,(dx'), we are in a position to de- 
rive an extremely fundamental relation between the K operator and the x operator. 
First, note that 

x^dx^lx') = x|x' + dx') — (x' + dx’)\x! + dx') (1 ,6.23a) 

and 

$(dx')x\x') =x'$(dx')\x') =x'\x' + dx'); (1.6.23b) 

hence, 

[x^fiix')] |x') = dx'\x' + dx') ^(ix'lx'), (1.6.24) 

where the error made in approximating the last step of (1.6.24) is of second or- 
der in dx'. Now |x') can be any position eigenket, and the position eigenkets are 
known to form a complete set. We must therefore have an operator identity 

[x, $(dx')] = dx', (1.6.25) 



or 



—ix K* dx' + iK- dx'x — dx', 



( 1 . 6 . 26 ) 




1 .6 Position, Momentum, and Translation 



45 



where, on the right-hand sides of (1.6.25) and (1.6.26), dx! is understood to be the 
number dx' multiplied by the identity operator in the ket space spanned by |x'). 
By choosing dx' in the direction of Xj and forming the scalar product with x, , we 
obtain 



[x i ,Kj-\ = i8 ij , (1.6.27) 

where again <5/y is understood to be multiplied by the identity operator. 

Momentum as a Generator of Translation 

Equation (1.6.27) is the fundamental commutation relation between the position 
operators x, y, z and the K operators K x ,K y ,K z . Remember that so far, the K 
operator is defined in terms of the infinitesimal translation operator by (1.6.20). 
What is the physical significance we can attach to K? 

J. Schwinger, lecturing on quantum mechanics, once remarked, “. . . for fun- 
damental properties we will borrow only names from classical physics.” In the 
present case we would like to borrow from classical mechanics the notion that 
momentum is the generator of an infinitesimal translation. An infinitesimal trans- 
lation in classical mechanics can be regarded as a canonical transformation, 

x new — X = x + dx, p new = p = p, (1.6.28) 

obtainable from the generating function (Goldstein 2002, pp. 386 and 403) 

F(x, P) = x • P + p- Jx, (1.6.29) 



where p and P refer to the corresponding momenta. 

This equation has a striking similarity to the infinitesimal translation operator 
(1 .6.20) in quantum mechanics, particularly if we recall that x • P in (1 .6.29) is the 
generating function for the identity transformation (X = x,P = p). We are there- 
fore led to speculate that the operator K is in some sense related to the momentum 
operator in quantum mechanics. 

Can the K operator be identified with the momentum operator itself? Unfortu- 
nately, the dimension is all wrong; the K operator has the dimension of 1/length 
because K • dx' must be dimensionless. But it appears legitimate to set 



K = 



P 

universal constant with the dimension of action 



(1.6.30) 



From the fundamental postulates of quantum mechanics there is no way to deter- 
mine the actual numerical value of the universal constant. Rather, this constant is 
needed here because, historically, classical physics was developed before quan- 
tum mechanics using units convenientfor describing macroscopic quantities — the 
circumference of the earth, the mass of 1 cc of water, the duration of a mean solar 
day, and so forth. Had microscopic physics been formulated before macroscopic 
physics, the physicists would have almost certainly chosen the basic units in such 
a way that the universal constant appearing in (1 .6.30) would be unity. 




Chapter 1 Fundamental Concepts 



An analogy from electrostatics may be helpful here. The interaction energy 
between two particles of charge e separated at a distance r is proportional to e 2 /r; 
in unrationalized Gaussian units, the proportionality factor is just 1, but in ratio- 
nalized mks units, which may be more convenient for electrical engineers, the 
proportionality factor is 1 /4jreo. (See Appendix A.) 

The universal constant that appears in (1.6.30) tums out to be the same as the 
constant h that appears in L. de Broglie’s relation, written in 1924, 



2n p 

T ~n’ 



(1.6.31) 



where X is the wavelength of a “particle wave.” In other words, the K operator is 
the quantum-mechanical operator that corresponds to the wave number — that is, 
2 tt times the reciprocal wavelength, usually denotedby k. With this identification, 
the infinitesimal translation operator $(dx') reads 

${dx') = 1 — ip-dx'/h, (1.6.32) 



where p is the momentum operator. The commutation relation (1.6.27) now be- 
comes 



[xi,pj] = iti8ij. (1.6.33) 

The commutation relations (1.6.33) imply, for example, that x and p x (but not 
x and p y ) are incompatible observables. It is therefore impossible to find simulta- 
neous eigenkets of x and p x . The general formalism of Section 1.4 can be applied 
here to obtain the position-momentum uncertainty relation of W. Heisenberg: 

{(Ax) 2 )((Ap x ) 2 )>h 2 /4. (1-6.34) 

Some applications of (1.6.34) will appear in Section 1.7. 

So far we have concemed ourselves with infinitesimal translations. A finite 
translation — that is, a spatial displacement by a finite amount — can be obtained 
by successively compounding infinitesimal translations. Let us consider a finite 
translation in the x-direction by an amount Ax': 

#(Ax'x)|x') = |x' + Ax'x). (1.6.35) 



By compounding N infinitesimal translations, each of which is characterized by a 
spatial displacement Ax' /N in the x-direction, and letting N — > oo, we obtain 



#(Ax'x) = lim ( 1 — 



iPxAx 1 



/\ N 



N->o o \ Ntl 

ip x Ax' 



exp 



n 



(1.6.36) 



Here txp(—ip x Ax'/h) is understood to be a function of the operator p x \ gener- 
ally, for any operator X we have 

„ X 2 

exp(X) = 1 4- X + — 4 . 



(1.6.37) 




1 .6 Position, Momentum, and Translation 



47 



D • 



• B 



i i J K 

A/y 

A • i • C 

Ax'& 

FIGURE 1.9 Successive translations in different directions. 



A fundamental property of translations is that successive translations in dif- 
ferent directions, say in the x- and y-directions, commute. We see this clearly in 
Figure 1.9; in shifting from A and B, it does not matter whether we go via C or 
via D. Mathematically, 

#( A/y)#( Ax'x) = ${Ax'x + A/y), ^ 

#(Ax'x)#(A/y) = #( Ax'x + A/y). ( 



This point is not so trivial as it may appear; we will show in Chapter 3 that ro- 
tations about different axes do not commute. Treating Ax' and A / up to second 
order, we obtain 



[#(A/y),#(Ax'x)] = 



\ iPy^y' Py(Ay') 2 



1- 



h 2 h 2 

ip x Ax' //(Ax') 2 



h 2/r 



+ • 



(Ax')(Ay')[p y ,p x ] 



Ti 2 

Because Ax' and A / are arbitrary, requirement (1.6.38), or 

[$(Ay'y),$(Ax'x)\ =0, 

immediately leads to 

[Px,Py] = 0 , 

or, more generally, 

lPi,Pj] = 0- 



(1.6.39) 



(1.6.40) 

(1.6.41) 

(1.6.42) 



This commutation relation is a direct consequence of the faet that translations in 
different directions commute. Whenever the generators of transformations com- 
mute, the corresponding group is said to be Abelian. The translation group in 
three dimensions is Abelian. 




48 



Chapter 1 Fundamental Concepts 



Equation (1.6.42) implies that p x , p y , and p z are mutually compatible ob- 
servables. We can therefore conceive of a simultaneous eigenket of p x , p y , p z , 
namely, 



IP') = I p'x’PyPz)’ (1.6.43a) 

Px\v') = p'x\v'), PyW) = Py\p'), Pz\p') = p' z \p')- (1.6.43b) 

It is instructive to work out the effect of $(dx') on such a momentum eigenket: 



$(dx!) |p') 



1 - 



ip-dx’ 



lp') = 1 - 



ip' ’dx! 

li 



lp')- 



(1.6.44) 



We see that the momentum eigenket remains the same even though it suffers a 
slight phase change, so, unlike |x 7 ), lp 7 ) is an eigenket of flddx’), which we antic- 
ipated because 



\p,$(dx')] = 0. (1.6.45) 

Notice, however, that the eigenvalue of $(dx') is complex; we do not expect a real 
eigenvalue here because $(dx 7 ), though unitary, is not Hermitian. 

The Canonical Commutation Relations 

We summarize the commutator relations we inferred by studying the properties 
of translation: 



[xi,xj] = 0, [pi,pj] = 0, [xi, pj] = ihSij. (1.6.46) 

These relations form the cornerstone of quantum mechanics; in his book, P. A. 
M. Dirac calls them the “fundamental quantum conditions.” More often they are 
known as the canonical commutation relations or the fundamental commuta- 
tion relations. 

Historically it was W. Heisenberg who, in 1925, showed that the combination 
rule for atomic transition lines known at that time could best be understood if one 
associated arrays of numbers obeying certain multiplication rules with these fre- 
quencies. Immediately afterward, M. Born and P. Jordan pointed out that Heisen- 
berg’s multiplication rules are essentially those of matrix algebra, and a theory 
based on the matrix analogues of (1.6.46) was developed; it is now known as 
matrix mechanics.* 

Also in 1925, P. A. M. Dirac observed that the various quantum-mechanical 
relations can be obtained from the corresponding classical relations just by re- 
placing classical Poisson brackets by commutators, as follows: 

[ , ] classical ~ ^ ~ tt ~ , (1.6.47) 

in 

*Appropriately, pq—qp = h/2ni is inscribed on the gravestone of M. Bom in Gottingen. 




1 .6 Position, Momentum, and Translation 



49 



where we may recall that the classical Poisson brackets are defined for functions 
of q ’ s and p’s as 



[A(q,p),B(q,p)] classical = 




dA dB 
dq s dp s 



8A dB \ 

3 Ps 3 q s ) ' 



For example, in classical mechanics, we have 



(1.6.48) 



\ x i i Pj ] classical — 3/ j , 



(1.6.49) 



which in quantum mechanics turns in to (1.6.33). 

Dirac’s rule (1.6.47) is plausible because the classical Poisson brackets and 
quantum-mechanical commutators satisfy similar algebraic properties. In particu- 
lar, the following relations can be proved regardless of whether [ , ] is understood 
as a classical Poisson bracket or as a quantum-mechanical commutator: 



[A,A] = 0 (1.6.50a) 

[A,B] = -[B,A] (1.6.50b) 

[A,c]=0 (cis just anumber) (1.6.50c) 

[A + B,C] = [A,C] + [B,C] (1.6.50d) 

[A,BC] = [A,B]C + B[A,C] (1.6.50e) 

[A, [B, C]] + [B, [C, A]] + [C, [A, B) ] = 0, (1 ,6.50f) 



where the last relation is known as the Jacobi identity.* However, there are im- 
portant differences. First, the dimension of the classical Poisson bracket differs 
from that of the quantum-mechanical commutator because of the differentiations 
with respect to q and p appearing in (1.6.48). Second, the Poisson bracket of real 
functions of q’s and p’s is purely real, whereas the commutator of two Hermitian 
operators is anti-Hermitian (see Lemma 3 of Section 1.4). To take care of these 
differences, the factor ifi is inserted in (1.6.47). 

We have deliberately avoided exploiting Dirac’s analogy in obtaining the 
canonical commutation relations. Our approach to the commutation relations is 
based solely on (1) the properties of translations and (2) the identification of 
the generator of translation with the momentum operator modulo, a universal 
constant with the dimension of action. We believe that this approach is more 
powerful because it can be generalized to situations where observables have no 
classical analogues. For example, the spin-angular-momentum components we 
encountered in Section 1.4 have nothing to do with the p’s and q’s of classical 
mechanics; yet, as we will show in Chapter 3, the spin-angular-momentum com- 
mutation relations can be derived using the properties of rotations, just as we 
derived the canonical commutation relations using the properties of translations. 

*It is amusing that the Jacobi identity in quantum mechanics is much easier to prove than its 
classical analogue. 




50 



Chapter 1 Fundamental Concepts 



1.7 ■ WAVE FUNCTIONS IN POSITION AND MOMENTUM SPACE 
Position-Space Wave Function 

In this section we present a systematic study of the properties of wave functions 
in both position and momentum space. For simplicity let us return to the one- 
dimensional case. The base kets used are the position kets satisfying 

jc|jc')=*V). (1.7.1) 

normalizedin such a way that the orthogonality condition reads 

{x"\x') = 8{x" -x'). (1.7.2) 

We have already remarked that the ket representing a physical State can be ex- 
panded in terms of \x'). 



|a) = J dx'\x')(x'\a), (1.7.3) 

and that the expansion coefficient {x'\a) is interpreted in such a way that 

\{x'\a)\ 2 dx' (1.7.4) 

is the probability for the particle to be found in a narrow interval dx' around x . In 
our formalism the inner product {x'\ a) is what is usually referred to as the wave 
function a (x ') for State |a): 



(x'|a) = (1.7.5) 

In elementary wave mechanics the probabilistic interpretations for the expan- 
sion coefficient c a > (= ( a'\a )) and for the wave function ^ a (x')(— {x'\ a)) are 
often presented as separate postulates. One of the major ad vantages of our for- 
malism, which was originated by Dirac, is that the two kinds of probabilistic in- 
terpretations are unified; (x) is an expansion coefficient [see (1.7.3)] in much 
the same way as c a ' is. By following in the footsteps of Dirac, we come to appre- 
ciate the unity of quantum mechanics. 

Consider the inner product (j5 |a). Using the completeness of \x'), we have 



<» = 



dx 1 (ft\x')(x'\a) 



= J dx'^*p(x')\lr a (x'). 



(1.7.6) 



so (/3 la) characterizes the overlap between the two wave functions. Note that we 
are not defining (fi |a) as the overlap integral; the Identification of (ft |a) with the 
overlap integral follows from our completeness postulate for \x'). The more gen- 
eral interpretation of {fi\ a), independent of represent ations, is that it represents 
the probability amplitude for State |a) to be found in State \fi). 




1 .7 Wave Functions in Position and Momentum Space 



51 



This time let us interpret the expansion 

\a) = Y J W){a'W) (1.7.7) 

a' 

using the language of wave functions. We just multiply both sides of (1.7.7) by 
the position eigenbra {x'\ on the left. Thus 

{x'\a.) = 'Y^{x'\a'){a'\ci). (1.7.8) 

a’ 

In the usual notation of wave mechanics, this is recognized as 

fa(x')= ^2c a ,u a’(x'), 

a’ 

where we have introduced an eigenfunction of operator A with eigenvalue a!\ 

u a '(x') = (x'\a'). (1.7.9) 

Let us now examine how (ft\A \a) can be written using the wave functions for 
\a ) and |/3). Clearly, we have 

dx j dx" {f3\x') {x'\A\x") {x" \a) 

(1.7.10) 

= f dx' j dx"^(x')(x l \A\x")i(r a (x"). 

Therefore, to be able to evaluate (fi\A\a), we must know the matrix element 
{x'\A\x"), which is, in general, a function of the two variables x' and x" . 

An enormous simplification takes place if observable A is a function of the 
position operator x. In particular, consider 

A=x 2 , (1.7.11) 

which actually appears in the Hamiltonian for the simple harmonic oscillator 
problem to be discussed in Chapter 2. We have 

{x'\x 2 \x") =({x'\)-(x" 2 \x")) = x' 2 Z(x -x"), (1.7.12) 

where we have used (1.7.1) and (1.7.2). The double integral (1.7.10) is now re- 
duced to a single integral: 

(y6|x 2 |a) = J dx' {P\x')x 2 (x'\cc) 

= J dx'\lr*p(x)x 2 \lr a (x'). 




(1.7.13) 




52 



Chapter 1 Fundamental Concepts 



In general, 



(P\f(x)\a) = f dx'rlrfa'Wx'W'ix'). (1.7.14) 

Note that the f(x ) on the left-hand side of (1.7.14) is an operator, whereas the 
f(x') on the right-hand side is not an operator. 



Momentum Operator in the Position Basis 

We now examine how the momentum operator may look in the x-basis — that is, in 
the representation where the position eigenkets are used as base kets. Our starting 
point is the definition of momentum as the generator of infinitesimal translations: 



(i ~ — -jp~ ) 1«) - J dx'%{Ax')\x'){x'\a.) 

= J dx'\x' + Ax'){x'\a) 

= J dx'\x'){x' — Ax'|at) 

= J dx'\x') ^(x'|a) - Ax' ^- t {x'\a) 
Comparison of both sides yields 



(1.7.15) 



p\a) = 




9 . \ 

~ ih jp {x |a> j 



(1.7.16) 



or 



{x'\p\a) = -ih — {x'\a) 



(1.7.17) 



where we have used the orthogonality property (1.7.2). For the matrix element p 
in the x-representation, we obtain 



(x'lpl-O = -ih-^- f 8(x' -x"). 
From (1.7.16) we get a very important iden ti ty: 



(£|p|a) 



J dx'{fi\x') ^-ih-^{x'\a)^j 

[ dMx')(-ih^-)^ a (x'). 

J V 3x'/ 



(1.7.18) 



(1.7.19) 



In our formalism, (1.7.19) is not a postulate; rather, it has been derived using 
the basic properties of momentum. By repeatedly applying (1.7.17), we can also 




1 .7 Wave Functions in Position and Momentum Space 



53 



obtain 

(x'\p n \a) = (-ih) n ^-(x'\a), (1.7.20) 

dx n 

(P\ P n \u)= J dx'xlfp(x')(-ih) n -^x// a (x'). (1.7.21) 

Momentum-Space Wave Function 

So far we have worked exclusively in the x-basis. But there is actually a complete 
symmetry between x and p (apart from occasional minus signs) that we can infer 
from the canonical commutation relations. Let us now work in the p-basis — that 
is, in the momentum representation. 

For simplicity we continue working in one-space. The base eigenkets in the 
p-basis specify 

p\p') = p'\p') (1-7.22) 

and 

<p'|p")=S(p'-p"). (1.7.23) 

The momentum eigenkets {lp')} spån the ket space in much the same way as the 
position eigenkets {|x')}. An arbitrary State ket |a) can therefore be expanded as 
follows: 

\a) = j dp'\p'){p'\a). (1.7.24) 

We can give a probabilistic interpretation for the expansion coefficient {p' |a) ; the 
probability that a measurement of p gives eigenvalue p' within a narrow interval 
dp' is | (p'\a) \ 2 dp'. It is customary to call (p'\a) the momentum-space wave 
function; the notation <p a (p') is often used: 

(p'\a)=<Pa(p r ). (1.7.25) 



If I«) is normalized, we obtain 

j dp'(a\p'){p'\a) = J dp'\<pct(p ')\ 2 = 1 . (1.7.26) 

Let us now establish the connection between the x-representation and the p- 
representation. We recall that in the case of the discrete spectra, the change of ba- 
sis from the old set \\a ') } to the new set {(//)} is characterized by the transforma- 
tion matrix (1.5.7). Likewise, we expect that the desired information is contained 
in (x'lp 7 ), which is a function of x' and p' , usually called the transformation 
function from the x-representation to the p-representation. To derive the explicit 
form of (x'lp'), first recall (1.7.17); letting |a) be the momentum eigenket |p'), 
we obtain 

U'IpIp') = - , ' ft a~7<x , lP / ) 



(1.7.27) 




54 



Chapter 1 Fundamental Concepts 



or 

p , (x , \p , ) = -ifi^(x , \p'). (1.7.28) 

The solution to this differential equationfor (x'\p') is 

(x , \p , )=Nex p(^). (1-7-29) 

where N is the normalization constant to be determined in a moment. Even though 
the transformation function {x'\ p') is a function of two variables, x' and p', we 
can temporarily regard it as a function of x' with p' fixed. It can then be viewed 
as the probability amplitude for the momentum eigenstate specified by p' to be 
found at position x'\ in other words, it is just the wave function for the momentum 
eigenstate | p'), often ref erred to as the momentum eigenfunction (still in the x- 
space). So (1.7.29) simply says that the wave function of a momentum eigenstate 
is a plane wave. It is amusing that we have obtained this plane-wave solution 
without solving the Schrodinger equation (which we have not yet written down). 

To get the normalization constant A^, let us first consider 

<xV')= yVfrVJtøV'). (1-7.30) 



The left-hand side is just 8{x' — x")\ the right-hand side can be evaluated using 
the explicit form of {x'\ p'): 



8(x' — x") = |7V| 2 



dp' exp 



~ip\x'-x") 

n 



= 2jrh\N\ 2 8(x' —x"). 



(1.7.31) 



Choosing N to be purely real and positive by convention, we finally have 

( * V) = vS exp ( !i r)' (1 ' 7 - 32) 

We can now demonstrate how the position-space wave function is related to 
the momentum-space wave function. All we have to do is rewrite 



(x'\ a) = j dp'(x'\p')(p'\a) 
and 

( p'\a ) = J dx'{p'\x'){x'\a) 



as 



^a(x') 



1 

s/2nti 



dp'ex p 




(1.7.33a) 



(1.7.33b) 



(pa(p') 



(1.7.34a) 




1.7 Wave Functions in Position and Momentum Space 



55 



and 



<Pa(p') 



1 

y/2nti 




f «{*')■ 



(1.7.34b) 



The pair of equations is just what one expects from Fourier’s inversion theo- 
rem. Apparently the mathematics we have developed somehow “knows” Fourier’s 
work on integral transforms. 



Gaussian Wave Packets 

It is instructive to look at a physical example to illustrate our basic formalism. 
We consider what is known as a Gaussian wave packet, whose x-space wave 
function is given by 



{x'\ a) 



' 1 " 




/ x ' 2 


_7T l / A \fd _ 


exp 


r x - 2 jt] 



(1.7.35) 



This is a plane wave with wave number k modulated by a Gaussian profile cen- 
tered on the origin. The probability of observing the particle vanishes very rapidly 
for \x'\ > d\ more quantitatively, the probability density | (x'\a) \ 2 has a Gaussian 
shape with width d. 

We now compute the expectation values of x, x 2 , p, and p 2 . The expectation 
value of x is clearly zero by symmetry: 

/ OO n OO 

dx' (a\x')x' (x'\a) = / dx'\ {x'\a) \ 2 x' = 0. (1.7.36) 

-OO ‘/ — OO 

For x 2 we obtain 

/ oo 

dx'x ,2 \{x'\a)\ 2 

-OO 

= ( ‘ jf^exp 

\y/Jtdj J_oo 

_d 2 _ 

~ 2 ’ 

which leads to 






d 2 



(1.7.37) 



((Ax) 2 ) = (x 2 )-(x) 2 = y (1.7.38) 

for the dispersion of the position operator. The expectation values of p and p 2 can 
also be computed as follows: 



(p) = hk 



</> = 4j + ^ 2 . 



(1.7.39a) 



(1.7.39b) 




56 



Chapter 1 Fundamental Concepts 



which is left as an exercise. The momentum dispersion is therefore given by 

fi 2 

<(A p) 2 ) = (p 2 )-(p) 2 = ^. (1.7.40) 

Armed with (1.7.38) and (1.7.40), we can check the Heisenberg uncertainty rela- 
tion (1.6.34); in this case the uncertainty product is given by 

fi 2 

<(Ax) 2 )((Ap) 2 } = -, (1.7.41) 

independent of d, so for a Gaussian wave packet we actually have an equality 
relation rather than the more general inequality relation (1.6.34). For this reason 
a Gaussian wave packet is often called a minimum uncertainty wave packet. 

We now go to momentum space. By a straigh tforward integration — just com- 
pleting the square in the exponent — we obtain 



(pV) 







dx' exp 



-ip'x' 



- ikx' 




'-(p 1 -fik) 2 d 2 ' 
Ih 2 




(1.7.42) 



This momentum-space wave function provides an alternative method for obtain- 
ing (p) and (p 2 ), which is also left as an exercise. 

The probability of finding the particle with momentum p' is Gaussian (in mo- 
mentum space) centered on hk, just as the probability of finding the particle at x' 
is Gaussian (in position space) centered on zero. Furthermore, the widths of the 
two Gaussians are inversely proportional to each other, which is just another way 
of expressing the constancy of the uncertainty product ((Ax) 2 )(A p) 2 ) explicitly 
computed in (1 .7.41). The wider the spread in the p-space, the narrower the spread 
in the x-space, and vice versa. 

As an extreme example, suppose we let d — > oo. The position-space wave 
function (1.7.35) then becomes a plane wave extending over all space; the prob- 
ability of finding the particle is just constant, independent of x' . In contrast, the 
momentum-space wave function is <5-function-like and is sharply peaked at hk. In 
the opposite extreme, by letting d — > 0, we obtain a position-space wave function 
localized like the <5-function, but the momentum-space wave function (1.7.42) is 
just constant, independent of p' . 

We have seen that an extremely well localized (in the x-space) State is to be 
regarded as a superposition of momentum eigenstates with all possible values of 
momenta. Even those momentum eigenstates whose momenta are comparable to 
or exceed mc must be included in the superposition. However, at such high values 
of momentum, a description based on nonrelativistic quantum mechanics is bound 




1 .7 Wave Functions in Position and Momentum Space 



57 



to break down.* Despite this limitation, our formalism, based on the existence of 
the position eigenket \x'), has a wide domain of applicability. 

Generalization to Three Dimensions 

So far in this section we have worked exclusively in one-space for simplicity, 
but everything we have done can be generalized to three-space, if the necessary 
changes are made. The base kets to be used can be taken as either the position 
eigenkets satisfying 



x|x') =x'|x') 


(1.7.43) 


or the momentum eigenkets satisfying 

pIp') = pV). 


(1.7.44) 


They obey the normalization conditions 

(x|x") = 5 3 (x'-x") 


(1.7.45a) 


and 


<p|p"> 


= 8 3 ( p'-p"), 


(1.7.45b) 


where 8 3 stands for the three-dimensional <5-function 

SV - x") = 8(x' - x")8(y' - y")8{z - z"). 


(1.7.46) 


The completeness relations read 


S é 


‘x'lx^ix'l = 1 


(1.7.47a) 


and 


j d 3 p' Ip'Kp'l = 1, 


(1.7.47b) 


which can be used to expand an arbitrary State ket: 

ler) = J d 3 x'\x'){x'\a). 


(1.7.48a) 


M= J 


r rf 3 p / |p / >(p / |a>. 


(1.7.48b) 



The expansion coefficients (x'|a) and (p'l«) are identified with the wave functions 
^„(x') and 4> a ( p') in position and momentum space, respectively. 

*It tums out that the concept of a localized State in relativistic quantum mechanics is far more 
intricate because of the possibility of “negative energy States,” or pair creation. See Chapter 8 of 
this book. 




58 



Chapter 1 Fundamental Concepts 



The momentum operator, when taken between \ji) and la), becomes 

(j8|p|a) = j (1.7.49) 



The transformation function analogous to (1.7.32) is 

1 



<xV> = 



|_ (27rTi) 3 / 2 J P V h 



i p' • x' 



(1.7.50) 



so that 



fo’(x') = 



L(27r70 3/2 J 



,3 , /ip'-x' 

d P exp ( -j— 



øq.(P / ) (1.7.51a) 



and 



ø«(p') = 



1 






3 , /—i p'-x' 

d x exp I 

Ti 



(1.7.51b) 



It is interesting to check the dimension of the wave functions. In one- 
dimensional problems the normalization requirement (1 .6.8) implies that | (x 1 |a)| 2 
has the dimension of inverse length, so the wave function itself must have the di- 
mension of (length) -1 / 2 . In contrast, the wave function in three-dimensional 
problems must have the dimension of (length) -3 / 2 because |{x / |o:)| 2 integrated 
over all spatial volume must be unity (dimensionless). 



Problems 

1.1 Prove 



[AB, CD] = -AC[D,B} + A{C,B}D-C{D,A)B + {C,A}DB. 
1.2 Suppose a 2 x 2 matrix X (not necessarily Hermitian or unitary) is written as 

X = ciq + <r • a. 



where ao and ai ; 2,3 are numbers. 

(a) How are ao and a*(k = 1,2,3) related to tr(X) and tr(a^X)? 

(b) Obtain ao and ak in terms of the matrix elements Xij . 

1.3 Show that the determinant of a 2 x 2 matrix o • a is invariant under 

. f ia •hcp\ ( — ia • nej) 

a • a -> a • a = exp I — - — J o • aexp I 



Find a' k in terms of a* when n is in the positive z-direction, and interpret your result. 




Problems 



59 



1.4 Using the rules of bra-ket algebra, prove or evaluate the following: 

(a) tr(XF) = tr(F X), where X and F are operators. 

(b) {XY f = X\ where X and F are operators. 

(c) exp[//(A)] =? in ket-bra form, where A is a Hermitian operator whose eigen- 
values are known. 

(d) J2 a ' ^*/(x')tMx"), where = (xV)- 

1.5 (a) Consider two kets |a) and \fi). Suppose (a , |o'),(a / '|o'),... and (a' |/3), 

(a"\P),... are all known, where \a’),\a"),... form a complete set of base 
kets. Find the matrix representation of the operator |a)(/S| in that basis. 

(b) We now consider a spin ^ system and let \ot) and \fi ) be |s z = h/2) and 
|.? v = Ti/ 2), respectively. Write down explicitly the square matrix that corre- 
sponds to \u){P\ in the usual (s z diagonal) basis. 



1.6 Suppose |i) and \j) are eigenkets of some Hermitian operator A. Under what con- 
dition can we conclude that |i) + |j) is also an eigenket of AT Justify your answer. 



1.7 Consider a ket space spanned by the eigenkets {|a')} of a Hermitian operator A. 
There is no degeneracy. 

(a) Prove that 

YV A ~ a ') 

a' 



is the nuil operator. 

(b) Explain the significance of 



n 

a" 7 ^ a 



(A-a") 

, (a' - a ") ' 



(c) Illustrate (a) and (b) using A set equal to S- of a spin \ system. 

1.8 Using the orthonormality of |+) and |— ), prove 



[Si,Sj] = i si j k h S k , {Si , Sj } 




Sij > 



where 

5, = |(l+>(-| + l-X+l), 5, = y(-|+>(-| + l-X+l), 

Sz = ^(l+X+l — I— )(— I)- 

1.9 Construct |S •!!;+) such that 

S.n|S.n;+> = Q)|S.n;+>, 

where n is characterized by the angles shown in the accompanying figure. Express 
your answer as a linear combination of |+) and |— ). [Note: The answer is 

cos (ø l+) + sin^ØO“|-). 




60 



Chapter 1 Fundamental Concepts 



But do not just verify that this answer satisfies the above eigenvalue equation. 
Rather, treat the problem as a straightforward eigenvalue problem. Also, do not use 
rotation operators, which we will introduce later in this book.] 




1.10 The Hamiltonian operator for a two-state system is given by 

tf =a(|l><l|-|2>(2| + |l)<2| + |2)<l|), 

where a is a number with the dimension of energy. Find the energy eigenvalues and 
the corresponding energy eigenkets (as linear combinations of 1 1) and |2)). 

1.11 A two-state system is characterized by the Hamiltonian 

tf = tfii|l>(l| + tf 2 2|2)<2| + tfi 2 (|l>(2| + |2)(l|], 

where tfn,tf 22 , and tf 12 are real numbers with the dimension of energy, and |1) 
and 1 2) are eigenkets of some observable ( ^ tf ). Find the energy eigenkets and the 
corresponding energy eigenvalues. Make sure that your answer makes good sense 
for tfi 2 = 0. (You need not solve this problem from scratch. The following faet 
may be used without proof: 



(S-n)|n;+) = -|n;+), 



with |n;+) given by 



|n;+) = cos^|+)+e'" sin^|-), 

where and a are the polar and azimuthal angles, respectively, that characterize n. 
The angles are defined in the figure following Problem 1.9.) 

1.12 Aspin j system is known to be in an eigenstate of S -n with eigenvalue ti/2, where 
n is a unit vector lying in the xz-plane that makes an angle y with the positive 
z-axis. 

(a) Suppose S x is measured. What is the probability of getting + Ti/ 21 




Problems 



61 



(b) Evaluate the dispersion in S x — that is, 



<(S*-<S,)) 2 >. 



(For your own peace of mind, check your answers for the special cases y = 0, 
nj 2 , and 7 T.) 

1.13 A beam of spin 5 atoms goes through a series of Stem-Gerlach-type measurements 
as follows: 

(a) The first measurement accepts s z = h/2 atoms and rejects s z = —h/2 atoms. 

(b) The second measurement accepts s n = h/2 atoms and rejects s n = —h/2 atoms, 
where s n is the eigenvalue of the operator S • n, with n making an angle ji in 
the ;cz-plane with respect to the z-axis. 

(c) The third measurement accepts s z = —h/2 atoms and rejects s z = h/2 atoms. 
What is the intensity of the final s z = —h/2 beam when the s z = h/2 beam surviv- 
ing the first measurement is normalized to unity? How must we orient the second 
measuring apparatus if we are to maximize the intensity of the final s z = —h/2 
beam? 



1.14 A certain observable in quantum mechanics has a 3 x 3 matrix representation as 
follows: 



1 

7! 



0 1 o \ 
10 1 
0 10 / 



(a) Find the normalized eigenvectors of this observable and the corresponding 
eigenvalues. Is there any degeneracy? 

(b) Give a physical example where all this is relevant. 

1.15 Let A and B be observables. Suppose the simultaneous eigenkets of A and B 
(| a' ,b')} form a complete orthonormal set of base kets. Can we always conclude 
that 



[A,B] = 0? 

If your answer is yes, prove the assertion. If your answer is no, give a counterex- 
ample. 

1.16 Two Hermitian operators anticommute: 

[A,B} = AB + BA = 0. 

Is it possible to have a simultaneous (that is, common) eigenket of A and Bl Prove 
or illustrate your assertion. 

1.17 Two observables A\ and At, which do not involve time explicitly, are known not 
to commute, 



[Ai,A2] ^ 0 , 

yet we also know that A\ and At both commute with the Hamiltonian: 



[Ai,H] = 0, [At, H] = 0. 




62 



Chapter 1 Fundamental Concepts 



Prove that the energy eigenstates are, in general, degenerate. Are there exceptions? 
As an example, you may think of the central-force problem H = p 2 /2m + V(r), 
with Ai -> L z , Ai -> L x . 

1.18 (a) The simplest way to derive the Schwarz inequality goes as follows. First, ob- 
serve 



((a|+X*<j8|).(|a) + X|/8))>0 

for any complex number X; then choose X in such a way that the preceding 
inequality reduces to the Schwarz inequality. 

(b) Show that the equality sign in the generalized uncertainty relation holds if the 
State in question satisfies 



AA|a) = XA£|a) 



with X purely imaginary. 

(c) Explicit calculations using the usual rules of wave mechanics show that the 
wave function for a Gaussian wave packet given by 



{x'\a) — (2nd 2 ) 1,/4 exp 



i(p)x' _ (x' - {x )) 
h 4 d 2 



21 



satisfies the minimum uncertainty relation 

V((Ax) 2 > v /((Ap)2) = 

Prove that the requirement 

(x'\A.x\a) = (imaginary number)(j; , | Apla) 

is indeed satisfied for such a Gaussian wave packet, in agreement with (b). 
1.19 (a) Compute 



((AS,) 2 } s <S 2 *> - (S x ) 2 , 

where the expectation value is taken for the S z + State. Using your result, check 
the generalized uncertainty relation 

((AA) 2 )((A-B) 2 )>i|([A,B])| 2 , 
with A — > S x , B — > S y . 

(b) Check the uncertainty relation with A -> S x , B S y for the S,+ State. 

1.20 Find the linear combination of |+) and | — ) kets that maximizes the uncertainty 
product 



<(AS,) 2 )((AS y ) 2 ). 

Verify explicitly that for the linear combination you found, the uncertainty relation 
for S x and S y is not violated. 




Problems 



63 



1.21 Evaluate thex-p uncertainty product ((Ax) 2 ) ((A p) 2 ) for a one-dimensional particle 
confined between two rigid walls, 

_ | 0 for 0 < x < a, 

\ oo otherwise. 

Do this for both the ground and excited States. 

1.22 Estimate the rough order of magnitude of the length of time that an ice pick can be 
balanced on its point if the only limitation is that set by the Heisenberg uncertainty 
principle. Assume that the point is sharp and that the point and the surface on 
which it rests are hard. You may make approximations that do not alter the general 
order of magnitude of the result. Assume reasonable values for the dimensions and 
weight of the ice pick. Obtain an approximate numerical result and express it in 
seconds. 

1.23 Consider a three-dimensional ket space. If a certain set of orthonormal kets — say, 
1 1) , |2), and |3) — are used as the base kets, the operators A and B are represented 
by 

la 0 0 \ Ib 0 0 \ 

A = 0 -a 0 , BA 0 0 -ib 

\ 0 0 -a ) \ 0 ib 0 / 

with a and b both real. 

(a) Obviously A exhibits a degenerate Spectrum. Does B also exhibit a degenerate 
Spectrum? 

(b) Show that A and B commute. 

(c) Find a new set of orthonormal kets that are simultaneous eigenkets of both A 
and B. Specify the eigenvalues of A and B for each of the three eigenkets. Does 
your specificalion of eigenvalues completely characterize each eigenket? 

1.24 (a) Prove that (l/v / 2)(l + io x ) acting on a two-component spinor can be regarded 

as the matrix representation of the rotation operator about the x-axis by angle 
— 7r/2. (The minus sign signifies that the rotation is clockwise.) 

(b) Construct the matrix representation of S z when the eigenkets of S y are used as 
base vectors. 

1.25 Some authors define an operator to be real when every member of its matrix el- 
ements (b'\A\b") is real in some representation ({|Z/)} basis in this case). Is this 
concept representation independent? That is, do the matrix elements remain real 
even if some basis other than {]?/)) is used? Check your assertion using familiar 
operators such as S y and S z (see Problem 1.24) or x and p x . 

1.26 Construct the transformation matrix that connects the S z diagonal basis to the S x 
diagonal basis. Show that your result is consistent with the general relation 

tf =X> w >< fl(r) i- 

r 

1.27 (a) Suppose that f( A) is a function of a Hermitian operator A with the property 

A\a') = a'\a'). Evaluate {b"\f(A)\b') when the transformation matrix from the 
a' basis to the b' basis is known. 




64 



Chapter 1 Fundamental Concepts 



(b) Using the continuum analogue of the result obtained in (a), evaluate 

(p"|F(r)|p / ). 

Simplify your expression as far as you can. Note that r is yjx 2 + y 2 + z 2 , where 
x, y, and z are operators. 

1.28 (a) Let x and p x be the coordinate momentum and the linear momentum in one 
dimension. Evaluate the classical Poisson bracket 



[ X , ) | C l:issi Ca l ■ 



(b) Let x and p x be the corresponding quantum-mechanical operators this time. 
Evaluate the commutator 



x,exp 




(c) Using the result obtained in (b), prove that 



^ I*'), (x|x') =x'|x'» 

is an eigenstate of the coordinate operator x. What is the corresponding eigen- 
value? 




1.29 (a) On page 247, Gottfried (1966) States that 

[x;,G(p)] = ih^-, [ pt,F(x )] = 

opt dXi 

can be “easily derived” from the fundamental commutation relations for all 
functions of F and G that can be expressed as power series in their arguments. 
Verify this statement. 

(b) Evaluate [x 2 ,p 2 ]. Compare your result with the classical Poisson bracket 

[x 2 , p^]classical- 

1.30 The translation operator for a finite (spatial) displacement is given by 

ffi(l) = exp ^ ^ , 

where p is the momentum operator. 

(a) Evaluate 



(b) Using (a) (or otherwise), demonstrate how the expectation value (x) changes 
under translation. 

1.31 In the main text we discussed the effect of $(dx') on the position and momen- 
tum eigenkets and on a more general State ket |a). We can also study the behavior 
of expectation values (x) and (p) under infmitesimal translation. Using (1.6.25), 
(1.6.45), and |a) — ► %(dx')\oi) only, prove (x) -*■ (x) +dx',(p) -*■ (p) under in- 
finitesimal translation. 




Problems 



65 



1.32 (a) Verify (1.7.39a) and (1.7.39b) for the expectation value of p and p 2 from the 

Gaussian wave packet (1.7.35). 

(b) Evaluate the expectation value of p and p 2 using the momentum-space wave 
function (1.7.42). 

1.33 (a) Prove the following: 

i. (p'\x\a) =ih—(p'\a), 
dp' 

/ Q 

(lp 1 (pp(p')ih — <t> a (p'), 

where (f>a(P ') = ( p'\& ) and <pp(p') = (p' |/3) are momentum-space wave func- 
tions. 

(b) What is the physical significance of 




where x is the position operator and E is some number with the dimension of 
momentum? Justify your answer. 




CH APTER 




Quantum Dynamics 



So far we have not discussed how physical systems change with time. This chap- 
ter is devoted exclusively to the dynamic development of State kets and/or ob- 
servables. In other words, we are concerned here with the quantum mechanical 
analogue of Newton’s (or Lagrange’s or Hamilton’s) equations of motion. 



2.1 ■TIME-EVOLUTION AND THE SCHRODINGER EQUATION 

The first important point we should keep in mind is that time is just a parameter 
in quantum mechanics, not an operator. In particular, time is not an observable 
in the language of the previous chapter. It is nonsensical to talk about the time 
operator in the same sense as we talk about the position operator. Ironically, in the 
historical development of wave mechanics both L. de Broglie and E. Schrodinger 
were guided by a kind of covariant analogy between energy and time on the one 
hånd and momentum and position (spatial coordinate) on the other. Yet when 
we now look at quantum mechanics in its finished form, there is no trace of a 
symmetrical treatment between time and space. The relativistic quantum theory 
of fields does treat the time and space coordinates on the same footing, but it does 
so only at the expense of demoting position from the status of being an observable 
to that of being just a parameter. 

Time-Evolution Operator 

Our basic concern in this section is, How does a State ket change with time? 
Suppose we have a physical system whose State ket at to is represented by la). At 
later times, we do not, in general, expect the system to remain in the same State 
|a). Let us denote the ket corresponding to the State at some later time by 

\a,to',t), (t > to), (2.1.1) 

where we have written a , to to remind ourselves that the system used to be in State 
|a) at some earlier reference time to. Because time is assumed to be a continuous 
parameter, we expect 



lim \a,to\t) — |a), 

t—yto 



and we may as well use a shorthand notation, 



I a,to',to) - \a,to). 



( 2 . 1 . 2 ) 



( 2 . 1 . 3 ) 



66 




2.1 Time-Evolution and the Schrodinger Equation 



67 



for this. Our basic task is to study the time evolution of a State ket: 



time evolution 

a,tQ) = \a) > 



I a,to\t). 



(2.1.4) 



To put it another way, we are interested in askinghow the State ketchanges under 
a time displacement to — > t. 

As in the case of translation, the two kets are relatedby an operator which we 
call the time-evolution operator V.(t, to): 



\a,toU) -V.(t,to)\a,to). (2.1.5) 

What are some of the properties we would like to ascribe to the time-evolution 
operator? The first important property is the unitary requirement for V.(t, to) that 
follows from probability conservation. Suppose that at to the State ket is expanded 
in terms of the eigenkets of some observable A: 

l<Mo) = ^2 c a'(t o)|a')- (2.1.6) 

a r 

Likewise, at some later time, we have 

\a,to',t) = y^c a '(?)|a'). (2.1.7) 

a r 

In general, we do not expect the modulus of the individual expansion coefficient 
to remain the same:* 



ka'(OI 7 ^ \c a '{to)\- (2.1.8) 

For instance, consider a spin ^ system with its spin magnetic moment subjected 
to a uniform magnetic held in the z-direction. To be specific, suppose that at to 
the spin is in the positive x-direction; that is, the system is found in an eigenstate 
of S x with eigenvalue h /2. As time goes on, the spin precesses in the xy-plane, 
as will be quantitatively demonstrated later in this section. This means that the 
probability for observing S x + is no longer unity at t > to', there is a finite prob- 
ability for observing S x — as well. Yet the sum of the probabilities for S x + and 
S x — remains unity at all times. Generally, in the notation of (2.1.6) and (2.1.7), 
we must have 



J> fl ^o)l 2 = XX'(OI 2 (2 - L9) 

a' a! 

despite (2. 1 .8) for the individual expansion coefficients. Stated another way, if the 
State ket is initially normalized to unity, it must remain normalized to unity at all 
later times: 



(a,to\a,to) = 1 => {a,to',t\a,to',t) = 1. (2.1.10) 

*We later show, however, that if the Hamiltonian commutes with A, then \c a /{t)\ is indeed equal 
to |c a ,(ro)|- 




68 



Chapter 2 Quantum Dynamics 



As in the translation case, this property is guaranteed if the time-evolution opera- 
tor is taken to be unitary. For this reason we take unitarity, 

= ( 2 . 1 . 11 ) 

to be one of the fundamental properties of the V. operator. It is no coincidence 
that many authors regard unitarity as synonymous with probability conservation. 
Another feature we require of the U operator is the composition property: 

U(t2,to) = (fy > t\ > to). (2.1.12) 

This equation says that if we are interested in obtaining time evolution from to to 
t 2 , then we can obtain the same result by considering time evolution first from to 
to t\ and then from t\ to t 2 — a reasonable requirement. Note that we read (2.1.1 2) 
from right to left! 

It also turns out to be advantageous to consider an infinitesimal time-evolution 
operator V.(to + dt,to): 

\a,to\to+dt) = U(to + dt,to)\a,to). (2.1.13) 

Because of continuity Lsee (2. 1 .2)] , the infinitesimal time-evolution operator must 
reduce to the identity operator as dt goes to zero, 

lim V.(to + dt,to) = 1, (2.1.14) 

dt- s-0 

and, as in the translation case, we expect the difference between V.(to + dt,to) and 
1 to be of first order in dt. 

We assert that all these requirements are satisfied by 

V.(to + dt,to)= 1—iQdt, (2.1.15) 

where £2 is a Hermitian operator,* 

£2 f = £2. (2.1.16) 

With (2.1.15) the infinitesimal time-displacement operator satisfies the composi- 
tion property 

U(to + dt\ +dt 2 ,to) = U(to + dt\ +dt 2 ,to + dti)V.(to + dti,to)', (2.1.17) 

it differs from the identity operator by a term of order dt. The unitarity property 
can also be checked as follows: 

V.\to + dt, to)V.(to + dt,to) = (1 + i&dt)(\ —iQ.dt) ~ 1, (2.1.18) 

to the extent that terms of order {dt) 2 or higher can be ignored. 

The operator £2 has the dimension of frequency or inverse time. Is there any 
familiar observable with the dimension of frequency? We recall that in the old 

*If the fi operator depends on time explicitly, then it must be evaluated at ro. 




2.1 Time-Evolution and the Schrodinger Equation 



69 



quantum theory, angular frequency co is postulated to be related to energy by the 
Planck-Einstein relation 



E = ha). (2.1.19) 

Let us now borrow from classical mechanics the idea that the Hamiltonian is the 
generator of time evolution (Goldstein 2002, pp. 40 1-2). It is then natural to relate 
Q to the Hamiltonian operator H: 




( 2 . 1 . 20 ) 



To sum up, the infinitesimal time-evolution operator is written as 

U(t 0 + dt,t 0 ) = l- 1 -^, ( 2 . 1 . 21 ) 

n 



where H, the Hamiltonian operator, is assumed to be Hermitian. The reader may 
ask whether the h introduced here is the same as the ti that appears in the ex- 
pression for the translation operator (1.6.32). This question can be answered by 
comparing the quantum-mechanical equation of motion we derive later with the 
classical equation of motion. It turns out that unless the two ti ’s are taken to be 
the same, we are unable to obtain a relation like 



dx _ p 
dt m 



( 2 . 1 . 22 ) 



as the classical limit of the corresponding quantum-mechanical relation. 



The Schrodinger Equation 

We are now in a position to derive the fundamental differential equation for the 
time-evolution operator V.(t,to). We exploit the composition property of the time- 
evolution operator by letting t\ — ► t, t 2 — »■ t + dt in (2.1.12): 

( iH dt\ 

U(t + dt,t 0 ) = U(t + dt,t)U(t,t 0 )= (l — (2.1.23) 

where the time difference t — to need not be infinitesimal. We have 

U(t + dt,t 0 )-U(t,to) = -i (JpjdtU(t,t 0 ), (2.1.24) 

which can be written in differential equation form: 

9 

ih — U(t,to) = HU(t,to). (2.1.25) 

dt 

This is the Schrodinger equation for the time-evolution operator. Everything 
that has to do with time development follows from this fundamental equation. 




70 



Chapter 2 Quantum Dynamics 



Equation (2.1.25) immediately leads to the Schrodinger equation for a State 
ket. Multiplying both sides of (2.1.25) by \a, to) on the right, we obtain 

0 

ih—U(t,to)\a,t 0 ) = HU(t,to)\a,t 0 ). (2.1.26) 

at 

But |a,to) does not dependon t, so this is the same as 

9 

ih — \a,to\t) — H\a,to',t), (2.1.27) 

at 



where (2.1.5) has been used. 

If we are given V.(t,t o) and, in addition, know how V.(t,to) acts on the initial 
State ket I«, to), it is not necessary to bother with the Schrodinger equation for the 
State ket (2.1.27). All we have to do is apply V.(t,to) to I«, to); in this manner we 
can obtain a State ket at any t. Our first task is therefore to derive formal solutions 
to the Schrodinger equation for the time-evolution operator (2.1.25). There are 
three cases to be treated separately: 

Case 1 . The Hamil tonian operator is independent of time. By this we mean 
that even when the parameter t is changed, the H operator remains unchanged. 
The Hamiltonianfor a spin-magnetic moment interacting with a time-independent 
magnetic held is an example of this. The solution to (2. 1 .25) in such a case is given 
by 



V.(t,to) = exp 



-i Hit - to) 
h 



(2.1.28) 



To prove this, let us expand the exponential as follows: 



exp 



’ -i Hit-to)' 


-i Hit -to) 


rc-o 2 ] 


's 

1 

i«* 

i 


h 


J ! h 


2 


L » J 



+ ■ 



Because the time derivative of this expansion is given by 



(2.1.29) 



— exp 
dt 



-iHjt-tp) 

h 



-i H 



+ 



(-0 



2-i 



„ , H , 

2 ( — 1 (t — to) + ■ 
h 



(2.1.30) 



expression (2.1.28) obviously satisfies differential equation (2.1.25). The bound- 
ary condition is also satisfied because as t — > to, (2.1.28) reduces to the identity 
operator. An alternative way to obtain (2.1.28) is to compound successively in- 
finitesimal time-evolution operators just as we did to obtain (1.6.36) for finite 
translation: 



lim 

N^-oo 




0 iH/h)(t-to ) 
N 



= exp 



-iH(t-to) 

h 



(2.1.31) 



Case 2. The Hamiltonian operator H is time-dependent but the H’s at different 
times commute. As an example, let us consider the spin-magnetic moment sub- 
jected to a magnetic held whose strength varies with time but whose direction is 




2.1 Time-Evolution and the Schrodinger Equation 



71 



always unchanged. The formal solution to (2. 1 .25) in this case is 
U(t,to) = exp 

This can be proved in a similar way. We simply replace H{t — tø) in (2.1.29) and 
(2.1.30) by 

Case 3 . The Tf s at different times do not commute. Continuing with the ex- 
ample involving spin-magnetic moment, we suppose, this time, that the magnetic 
held direction also changes with time: at t — t\ in the x-direction, sAt — ti in the 
y-direction, and so forth. Because S x and S y do not commute, H(t\) and H (ti), 
which go like S *B, do not commute either. The formal solution in such a situation 
is given by 



I dt' Hit') 
Jtn 



(2.1.32) 



OO 

y-(t,to) — 1 + t : 

n~l 




'tn — 1 






dt2 ■■■ / dt n H(ti)H(t2)--- H(t n ), 

JtQ 

(2.1.33) 



which is sometimes known as the Dyson series, after F. J. Dyson, who developed 
a perturbation expansion of this form in quantum held theory. We do not prove 
(2.1.33) now because the proof is very similar to the one presented in Chapter 5 
for the time-evolution operator in the interaction picture. 

In elementary applications, only case 1 is of practical interest. In the remaining 
part of this chapter we assume that the H operator is time-independent. We will 
encounter time-dependent Hamiltonians in Chapter 5. 



Energy Eigenkets 

To be able to evaluate the effect of the time-evolution operator (2.1.28) on a gen- 
eral initial ket la), we must first know how it acts on the base kets used in expand- 
ing |a). This is particularly straightforward if the base kets used are eigenkets of 
A such that 



[A,H] — 0; (2.1.34) 

then the eigenkets of A are also eigenkets of H, called energy eigenkets, whose 
eigenvalues aredenotedby E a r. 



H\a') = E a ,\o'). (2.1.35) 

We can now expand the time-evolution operator in terms of\a'){a'\. Taking to — 0 
for simplicity, we obtain 

= ^^|a")(a"|exp( j \a'){a'\ 

a' a" ^ ’ 





72 



Chapter 2 Quantum Dynamics 



The time-evolution operator written in this form enables us to solve any initial- 
value problem once the expansion of the initial ket in terms of {|a')} is known. As 
an example, suppose that the initial ket expansion reads 

\a,t 0 = 0) = ^ | a')(a'\a) = ^c a /|a'). (2.1.37) 

a' a' 



We then have 



\a,t 0 = 0;r) = exp^-^^ |ot,/ 0 = 0) = ^ |a / )(a / |a)exp 



In other words, the expansion coefficient changes with time as 

i E a 't 



c a '(t = 0) Ca'it) = C a '{t = 0)exp 



h 



(2.1.38) 

(2.1.39) 



with its modulus unchanged. Notice that the relative phases among various com- 
ponents do vary with time because the oscillation frequencies are different. 

A special case of interest is where the initial State happens to be one of {| a')} 
itself. We have 



|cMo = 0) = \a') (2.1.40) 

initially, and at a later time 

\a,to = 0;0 = \a)exp(^-~^j, (2.1.41) 

so if the system is initially a simultaneous eigenstate of A and H, it remains so at 
all times. The most that can happen is the phase modulation, exp (—iE a 't /h). It 
is in this sense that an observable compatible with H [see (2.1.34)] is a constant 
of the motion. We will encounter this connection once again in a different form 
when we discuss the Heisenberg equation of motion. 

In the foregoing discussion the basic task in quantum dynamics is reduced to 
finding an observable that commutes with H and evaluating its eigenvalues. Once 
that is done, we expand the initial ket in terms of the eigenkets of that observ- 
able and just apply the time-evolution operator. This last step amounts merely to 
changingthe phase of each expansion coefficient, as indicatedby (2.1.39). 

Even though we worked out the case where there is just one observable A that 
commutes with H, our considerations can easily be generalized when there are 
several mutually compatible observables all also commuting with H: 

[A,B] = [B,C] = [A,C]=-=0, 

[A,H] = [B,H] = [C,H] = ■■■ = 0. (2.1 .42) 

Using the collective index notation of Section 1.4 [see (1.4.37)], we have 





2.1 Time-Evolution and the Schrodinger Equation 



73 



where E K > is uniquely specified once a',b',c',.. . are specified. It is therefore of 
fundamental importance to find a complete set of mutually compatible observ- 
ables that also commute with H. Once such a set is found, we express the initial 
ket as a superposition of the simultaneous eigenkets of A,B, C, ... and H. The final 
step is just to apply the time-evolution operator, written as (2. 1 .43). In this manner 
we can solve the most general initial-value problem with a time-independent H. 



Time Dependence of Expectation Values 

It is instructive to study how the expectation value of an observable changes as a 
function of time. Suppose that at t — 0 the initial State is one of the eigenstates 
of an observable A that commutes with H, as in (2.1.40). We now look at the 
expectation value of some other observable B, which need not commute with A or 
with H. Because at a later time we have 

\a',to = 0-,t) = U(t,0)\a') (2.1.44) 



for the State ket, (5) is given by 

(B) = ((a'\U\t,0))-B-mt,0)\a')) 



- (a | exp 
= {a'\B\a'), 



lEa't 

Pi 



B exp 



-iEg't 

Pi 



I a) 



(2.1.45) 



which is independent of t. So the expectation value of an observable taken with 
respect to an energy eigenstate does not change with time. For this reason an 
energy eigenstate is often ref erred to as a stationary State. 

The situation is more interesting when the expectation value is taken with re- 
spect to a superposition of energy eigenstates, or a nonstationary State. Suppose 
that initially we have 



l<M0 =0) = J2 C a'\ a ')- 



(2.1.46) 



We easily compute the expectation value of B to be 



(B) = 



E * , /, / iE a ’t 

c a ,(a | exp 



■B 



y. Cg" exp 



iE a "t \ 

I a ) 



= y y c* a ,c a " {a\ B \a") exp 



-i{Eg" - E a f)t 
Pi 



(2.1.47) 



So this time the expectation value consists of oscillating terms whose angular 
frequencies are determined by N. Bohr’s frequency condition 



n' — 



( Eg " Egl) 
~ll ■ 



(2.1.48) 




74 



Chapter 2 Quantum Dynamics 



Spin Precession 

It is appropriate to treat an example here. W e consider an extremely simple system 
that, however, illustrates the basic formalism we have developed. 

We start with a Hamiltonian of a spin ^ system with magnetic moment 
eh/2 m e c subjected to an external magnetic held B: 



H = - 




(2.1.49) 



(e < 0 for the electron). Furthermore, we take B to be a static, uniform magnetic 
held in the z-direction. We can then write H as 



H = - 




(2.1.50) 



Because S z and H differ just by a multiplicative constant, they obviously com- 
mute. The S z eigenstates are also energy eigenstates, and the corresponding en- 
ergy eigenvalues are 



efiB 

E ± = t~ , for S z ± . (2.1.51) 

2 m e c 

It is convenient to define oo in such a way that the difference in the two energy 
eigenvalues is hoo: 



\e\ B 

oo . 

m e c 



We can then rewrite the H operator simply as 



H = ooS z . 



(2.1.52) 



(2.1.53) 



All the information on time development is contained in the time-evolution 
operator 



U(t, 0) = exp ^ — . (2.1.54) 

We apply this to the initial State. The base kets we must use in expanding the initial 
ket are obviously the S z eigenkets, |+) and |— ), which are also energy eigenkets. 
Suppose that at t = 0 the system is characterized by 



|a) = c+|+) +c_|— ). (2.1.55) 

Upon applying (2.1.54), we see that the State ket at some later time is 

-iaot' 



la, to = 0; t) = c+exp 



|+) +c_exp 






(2.1.56) 



2 



2 




2.1 Time-Evolution and the Schrodinger Equation 



75 



where we have used 



H |±) = 




(2.1.57) 



Specifically, let us suppose that the initial ket |a) represents the spin-up (or, 
more precisely, S z +) State |+), which means that 



c + = 1, c- = 0. 



(2.1.58) 



At a later time, (2.1.56) tells us that it is still in the spin-up State, which is no 
surprise because this is a stationary State. 

Next, let us suppose that initially the system is in the S x + State. Comparing 
(1.4.17a) with (2.1.55), we see that 



c+ 




(2.1.59) 



It is straightforward to work out the probabilities for the system to be found in the 
S x ± State at some later time t: 



|(S*±|cmo = 0;0| 2 = 



LW2 I<+I± 



1 

n 



H 



’.Cæ) 



exp 



+ 



iM") 



) 



I-) 



■exp 



-icot . 

, i - exp 
2 / 2 F 



1 / +i(Dt \ 






(Ot 



= cos - — for 



j cot 

— sin — for 



and 

s x — 




1 +) 



(2.1.60a) 

(2.1.60b) 



Even though the spin is initially in the positive x-direction, the magnetic held in 
the z-direction causes it to rotate; as a result, we obtain a finite probability for 
finding S x — at some later time. The sum of the two probabilities is seen to be 
unity at all times, in agreement with the unitarity property of the time-evolution 
operator. 

Using (1.4.6), we can write the expectation value of S x as 



(S x ) = 





COS (Ot, 



(2.1.61) 



so this quantity oscillates with an angular frequency corresponding to the differ- 
ence of the two energy eigenvalues di vided by h, in agreement with our general 




76 



Chapter 2 Quantum Dynamics 



formula (2.1.47). Similar exercises with S y and S z show that 



(Sy) = 




sin cut 



and 



(5 Z )=0. 



(2.1.62a) 



(2.1.62b) 



Physically this means that the spin precesses in the xy-plane. We will comment 
further on spin precession when we discuss rotation operators in Chapter 3. 

Experimentally, spin precession is well established. In faet, it is used as a tool 
for other investigations of fundamental quantum-mechanical phenomena. For ex- 
ample, the form of the Hamiltonian (2.1.49) can be deri ved for point-like par- 
ticles, such as electrons or muons, that obey the Dirac equation, for which the 
gyromagnetic ratio g = 2. (See Section 8.2.) However, higher-order corrections 
from quantum held theory predict a small but precisely calculable deviation from 
this, and it is a high priority to produce competitively precise measurements of 
g -2. 

Such an experiment has been recently completed. See G. W. Bennett et al., 
Phys. Rev. D 73 (2006) 072003. Muons are injected into a “storage ring” designed 
so that their spins will precess in lock step with their momentum vector only 
if g = 2. Consequently, observation of their precession measures g — 2 direetly, 
facilitating a very precise result. Figure 2.1 shows the experimen ters’ observation 
of the muon spin rotation over more than one hundred periods. They determine 




t : : : ! : : = ! : = = ! = : : ! : : : I 

0 20 40 60 80 100 

Time modulo 100 /xs 



FIGURE 2.1 Observations of the precession of muon spin by G. W. Bennett et al., 
Phys. Rev. D 73 (2006) 072003. Data points are wrapped around every 100 /xs. The size 
of the signal decreases with time because the muons decay. 





2.1 Time-Evolution and the Schrodinger Equation 



77 



a value for g — 2 to a precision smaller than one part per million, which agrees 
reasonably well with the theoretical value. 

Neutrino Oscillations 

A lovely example of quantum-mechanical dynamics leading to interference in 
a two-state system, based on current physics research, is provided by the phe- 
nomenon known as neutrino oscillations. 

Neutrinos are elementary particles with no charge and very small mass, much 
smaller than that of an electron. They are known to occur in nature in three distinet 
“flavors,” although for this discussion it suffices to consider only two of them. 
These two flavors are identified by their interactions, which may be either with 
electrons, in which case we write v e , or with muons, that is v^. These are in faet 
eigenstates of a Hamiltonian that Controls those interactions. 

On the otherhand, it is possible (and, in faet, is now known to be true) that neu- 
trinos have some other interactions, in which case their energy eigenvalues cor- 
respond to States that have a well-defined mass. These “mass eigenstates” would 
have eigenvalues E\ and E 2 , say, corresponding to masses m 1 and » 12 , and might 
be denoted as |vi) and | V 2 )- The “flavor eigenstates” are related to these through 
a simple unitary transformation, specified by some mixing angle 6, as follows: 

\v e ) = cosØ|vi) — sinØ|v 2 ) (2.1.63a) 

|v M ) = sinØ|vi) +cosd|v 2 ) (2.1.63b) 

If the mixing angle were zero, then \v e ) and | v IÅ ) would respectively be the same 
as |vi) and | V 2 ) - However, we know of no reason why this should be the case. 
Indeed, there is no strong theoretical bias for any particular value of 9, and it is a 
free parameter that, today, can be determined only through experiment. 

Neutrino oscillation is the phenomenon by which we can measure the mixing 
angle. Suppose we prepare, at time t — 0, a momentum eigenstate of one flavor of 
neutrino, say \v e ). Then according to (2.1.63a), the two different mass eigenstate 
components will evolve with different frequencies and therefore develop a relative 
phase difference. If the difference in the masses is small enough, then this phase 
difference can build up over a macroscopic distance. In faet, by measuring the 
interference as a funetion of difference, one can observe oscillations with a period 
that depends on the difference of masses, and an amplitude that depends on the 
mixing angle. 

It is straigh tforward (see Problem 2.4 at the end of this chapter) to use (2.1.63) 
along with (2.1.28) and our quantum-mechanical postulates, and find a measur- 
able quantity that exhibits neutrino oscillations. In this case, the Hamiltonian is 
just that for a free particle, but we need to take some care. Neutrinos are very 
low mass, so they are highly relativistic for any practical experimental conditions. 
Therefore, for a fixed momentum p, the energy eigen value for a neutrino of mass 
m is given to an extremely good approximation as 




78 



Chapter 2 Quantum Dynamics 




FIGURE 2.2 Neutrino oscillations as observed by the KamLAND experiment, taken 
from S. Abe et al., Phys. Rev. Lett. 100 (2008) 221803. The oscillations as a function of 
L/£ demonstrate interference between different mass eigenstates of neutrinos. 



If we next allow our State | v e ) to evolve, and then at some later time t ask what is 
the probability that it still appears as a | v e ) (as opposed to a | ry,)), we find 

P(v e — > v e ) = 1 — sin 2 29 sin 2 ( Am 2 c 4 | , (2.1.65) 

\ AEtic ) 

where Am 2 = m 2 — m 2 , L = ct is the flight distance of the neutrino, and E = pc 
is the nominal neutrino energy. 

The oscillations predicted by (2.1.65) have been dramatically observed by the 
KamLAND experiment. See Figure 2.2. Neutrinos from a series of nuclear re- 
actors are detected at a distance of ~ 150 km, and the rate is compared to that 
expected from reactor power and properties. The curve is not a perf eet sine wave 
because the reactors are not all at the same distance from the detector. 

Correlation Amplitude and the Energy-Time Uncertainty Relation 

We conclude this section by asking how State kets at different times are correlated 
with each other. Suppose the initial State ket at t = 0 of a physical system is given 
by |a). With time it changes into \a,to = O;;), which we obtain by applying the 
time-evolution operator. We are concerned with the extent to which the State ket 
at a later time t is similar to the State ket at t = 0; we therefore construct the inner 
product between the two State kets at different times: 



C(t)={a\a,t 0 =0 \t) 
= (a|T((f,0)|a), 



( 2 . 1 . 66 ) 




2.1 Time-Evolution and the Schrodinger Equation 



79 



which is known as the correlation amplitude. The modulus of C(t) provides 
a quantitative measure of the “resemblance” between the State kets at different 
times. 

As an extreme example, consider the very special case where the initial ket |at) 
is an eigenket of H\ we then have 

7 7 (~iE a 't\ 

C(t ) = (a'\a , t 0 = 0 ;r) = exp ( - - J, (2.1.67) 

so the modulus of the correlation amplitude is unity at all times — which is not 
surprising for a stationary State. In the more general situation where the initial ket 
is represented by a superposition of (la 7 )}, as in (2.1.37), we have 




As we sum over many terms with oscillating time dependence of different fre- 
quencies, a strong cancellation is possible for moderately large values of t. We 
expect the correlation amplitude that starts with unity at t = 0 to decrease in mag- 
nitude with time. 

To estimate (2.1.68) in a more concrete manner, let us suppose that the State 
ket can be regarded as a superposition of so many energy eigenkets with simi- 
lar energies that we can regard them as exhibiting essentially a quasi-continuous 
Spectrum. It is then legitimate to replace the sum by the integral 




dEp(E), 



c„> 



g(E ) 



E~E„ 



(2.1.69) 



where p(E) characterizes the density of energy eigenstates. Expression (2.1.68) 
now becomes 

C(t) = j dE\g{E)\ 2 p{E)tx V (^^j, (2.1.70) 

subject to the normalization condition 

JdE\g(E)\ 2 p(E)=\. (2.1.71) 



In a realistic physical situation, |g(£’)| 2 p(£’) may be peaked around E = Eq with 
width AE. Writing (2.1.70) as 



C(t) = exp 



-i E 0 t 



dE\g(E)\ 2 p(E)exp 



-i{E-Ep)t' 

Ti 



(2.1.72) 



we see that as t becomes large, the integrand oscillates very rapidly unless the 
energy interval \E — Eq\ is small compared with ti/ 1 . If the interval for which 




Chapter 2 Quantum Dynamics 



| E — E()\ ~ h/t holds is much narrower than AE — the width of \g(E)\ 2 p(E ) — we 
get essentially no contribution to C(t) because of strong cancellations. The char- 
acteristic time at which the modulus of the correlation amplitude starts becoming 
appreciably different from 1 is given by 



t ~ 



n 

AE' 



(2.1.73) 



Even though this equation is obtained for a superposition State with a quasi- 
continuous energy Spectrum, it also makes sense for a two-level system; in the 
spin-precession problem considered earlier, the State ket, which is initially |5^+), 
starts losing its identity after ~ 1 /to = h/(E + — EJ), as is evident from (2.1.60). 

To summarize, as a result of time evolution the State ket of a physical sys- 
tem ceases to retain its original form after a time interval of order h/ AE. In the 
literature this point is often said to illustrate the time-energy uncertainty relation 



At AE ~ h. 



(2.1.74) 



However, this time-energy uncertainty relation is of a very different nature from 
the uncertainty relation between two incompatible observables discussed in Sec- 
tion 1.4. In Chapter 5 we will come back to (2.1.74) in connection with time- 
dependent perturbation theory. 



2.2 ■ THE SCHRODINGER VERSUS THE HEISENBERG PICTURE 
Unitary Operators 

In the previous section we introducedthe concept of time development by consid- 
ering the time-evolution operator that affects State kets; this approach to quantum 
dynamics is known as the Schrodinger picture. There is another formulation of 
quantum dynamics where observables, rather than State kets, vary with time; this 
second approach is known as the Heisenberg picture. Before discussing the dif- 
ferences between the two approaches in detail, we digress to make some general 
comments on unitary operators. 

Unitary operators are used for many different purposes in quantum mechan- 
ics. In this book we introduced (Section 1.5) an operator satisfying the unitarity 
property. In that section we were concerned with the question of how the base 
kets in one representation are related to those in some other representations. The 
State kets themselves are assumed not to change as we switch to a different set of 
base kets, even though the numerical values of the expansion coefficients for la) 
are, of course, different in different representations. Subsequently we introduced 
two unitary operators that actually change the State kets, the translation operator 
of Section 1.6 and the time-evolution operator of Section 2.1. We have 

la) -> U\a), (2.2.1) 

where U may stand for T (dx) or %l(t, to). Here U\a) is the State ket corresponding 
to a physical system that actually has undergone translation or time evolution. 




2.2 The Schrodinger Versus the Heisenberg Picture 



81 



It is important to keep in mind that under a unitary transformation that changes 
the State kets, the inner product of a State bra and a State ket remains unchanged: 

(P\a) -+ (P\U'U\a) = (f3\ a). (2.2.2) 

Using the faet that these transformations affeet the State kets but not operators, we 
can inferhow (f3\X\a) mustchange: 

(fi\X\a) -> mU t )-X-(U\a)) = (P\U*XU\a). (2.2.3) 

We now make a very simple mathematical observation that follows from the as- 
sociative axiom of multiplication: 

mtf) • X • (U\a)) = (0\ ■ (U'XU) - | a). (2.2.4) 

Is there any physics in this observation? This mathematical identity suggests two 
approaches to unitary transformations: 

Approach 1: 

|a) — > U la), with operators unchanged. (2.2.5a) 

Approach 2: 

X — * U^XU, with State kets unchanged. (2.2.5b) 

In classical physics we do not introduce State kets, yet we talk about translation, 
time evolution, and the like. This is possible because these operations actually 
change quantities such as x andL, which are observables of classical mechanics. 
We therefore conjecture that a doser connection with classical physics may be 
established if we follow approach 2. 

A simple example may be helpful here. We go back to the infinitesimal transla- 
tion operator T (dx '). The formalism presented in Section 1 .6 is based on approach 
1; T (dx 0 affeets the State kets, not the position operator: 

, ( ip‘dx f \ 

]a) V 1 h ~ ) |a;) ’ (2.2.6) 

x — > x. 

In contrast, if we follow approach 2 , we obtain 



I«) k), 




= x + dx'. (2.2.7) 

We leave it as an exercise for the reader to show that both approaches lead to the 
same result for the expectation value of x: 

(x) -> (x) + (dx!). 



( 2 . 2 . 8 ) 




Chapter 2 Quantum Dynamics 

State Kets and Observables in the Schrodinger and the Heisenberg Pictures 

We now return to the time-evolution operator V.(t,to). In the previous section 
we examined how State kets evolve with time. This means that we were following 
approach 1 , which is known as the Schrodinger picture when it is applied to time 
evolution. Alternatively, we may follow approach 2, known as the Heisenberg 
picture when applied to time evolution. 

In the Schrodinger picture the operators corresponding to observables such as 
x, p y , and S z are fixed in time, while State kets vary with time, as indicated in the 
previous section. In contrast, in the Heisenberg picture the operators correspond- 
ing to observables vary with time; the State kets are fixed — frozen, so to speak — at 
what they were at to. It is convenient to set to in V.(t,to) to zero for simplicity and 
work with V.(t), which is defined by 



U(t,t 0 = 0) = U(t) = ex V (^p-y (2.2.9) 

Motivated by (2.2.5b) of approach 2, we define the Heisenberg picture observable 
by 



A {H \t) = W(t)A {S) U(t), (2.2.10) 

where the superscripts H and S stand for Heisenberg and Schrodinger, re- 
spectively. At t = 0, the Heisenberg picture observable and the corresponding 
Schrodinger picture observable coincide: 

A (H \0) = A (S) . (2.2.11) 

The State kets also coincide between the two pictures at t = 0; at later t the 
Heisenberg-picture State ket is frozen at what it was at t = 0: 

\a,to =0-,t) H = \a,to = 0), (2.2.12) 

independent of t. This is in dramatic contrast with the Schrodinger-picture State 
ket, 



\a,to = 0 -,t)s = U{t)\a,to = 0). (2.2.13) 

The expectation value (A) is obviously the same in both pictures: 

s{a,to = 0;i|A (5) |aTo = 0 -,t)s = {ot, to = OlU^ A (S) U\a,to = 0) 

= H{ot,to = 0;t\A iH \t)\a,to = 0;t) H - (2.2.14) 

The Heisenberg Equation of Motion 

We now derive the fundamental equation of motion in the Heisenberg picture. 
Assuming that does not depend explicitly on time, which is the case in most 




2.2 The Schrodinger Versus the Heisenberg Picture 



83 



physical situations of interest, we obtain [by differentiating (2.2.10)] 

^ = ^ A(s)n + n < A( s ) ™ 

dt 3 t 3 t 

= -\u^HUU^A {S) U + —U^A {S) UU^HU 
in ih 

= ^-[A {H \u^HU], (2.2.15) 

ih 

where we have used [see (2.1.25)] 

(2.2.16a) 
(2.2.16b) 

Because H was originally introduced in the Schrodinger picture, we may be 
tempted to define 

H m = V?HU (2.2.17) 

in accordance with (2.2.10). But in elementary applications where % L is given by 
(2.2.9), U and H obviously commute; as a result, 

U i HU = H, (2.2.18) 



dU 1 n 

= —HU, 

dt ih 

3 ti* 1 t 



so it is all right to write (2.2.15) as 



dA (H > 

dt 




(2.2.19) 



This equation is known as the Heisenberg equation of motion. Notice that we 
have derived it using the properties of the time-evolution operator and the defining 
equation for A^ H \ 

It is instructive to compare (2.2.19) with the classical equation of motion in 
Poisson bracket form. In classical physics, for a function A of q’s and p’s that 
does not involve time explicitly, we have (Goldstein 2002, pp. 396-97) 

— = [A, 77] c lassical • (2.2.20) 

dt 

Again, we see that Dirac’s quantization rule (1.6.47) leads to the correct equation 
in quantum mechanics. Indeed, historically (2.2.19) was first written by P. A. M. 
Dirac, who — with his characteristic modesty — called it the Heisenberg equation 
of motion. It is worth noting, however, that (2.2.19) makes sense whether or not 
A (H> has a classical analogue. For example, the spin operator in the Heisenberg 
picture satisfies 




dt 



( 2 . 2 . 21 ) 




84 



Chapter 2 Quantum Dynamics 



which can be used to discuss spin precession, but this equation has no classical 
counterpart because S z cannot be written as a function of q’s and p’s. Rather than 
insisting on Dirac’s rule, (1.6.47), we may argue that for quantities possessing 
classical counterparts, the correct classical equation can be obtained from the cor- 
responding quantum-mechanical equation via the ansatz, 



u 

i ti 



[ > ] classical • 



( 2 . 2 . 22 ) 



Classical mechanics can be derived from quantum mechanics, but the opposite is 
not true.* 



Free Partic les; Ehrenfest's Theorem 

Whether we work in the Schrodinger picture or in the Heisenberg picture, to be 
able to use the equations of motion we must first leam how to construct the appro- 
priate Hamiltonian operator. For a physical system with classical analogues, we 
assume the Hamiltonian to be of the same form as in classical physics; we merely 
replace the classical x t ’s and p t ’s by the corresponding operators in quantum me- 
chanics. With this assumption we can reproduce the correct classical equations in 
the classical limit. Whenever an ambiguity arises because of noncommuting ob- 
servables, we attempt to resolve it by requiring H to be Hermi tian; for instance, we 
write the quantum-mechanical analogue of the classical productxp as \{xp + px). 
When the physical system in question has no classical analogues, we can only 
guess the structure of the Hamiltonian operator. We try various forms un til we get 
the Hamiltonian that leads to results agreeing with empirical observation. 

In practical applications it is often necessary to evaluate the commutator of Xi 
(or pt) with functions of xj and pj. To this end thefollowing formulas are useful: 

[ Xi ,F(p)]=ih^ (2.2.23a) 

3 Pi 

and 

[ Pi ,G(x)\ = -iti — , (2.2.23b) 

3 x, 

where F and G are functions that can be expanded in powers of pj ’s and xj ’s, 
respectively. We can easily proveboth formulas by repeatedly applying (1.6.50e). 

We are now in a position to apply the Heisenberg equation of motion to a free 
particle of mass m. The Hamiltonian is taken to be of the same form as in classical 
mechanics: 



p2 _ (P 2 x + P 2 y + Pf) 
2 m 2 m 



(2.2.24) 



*In this book we follow the following order: the Schrodinger picture -» the Heisenberg picture 
-> classical. For an enlightening treatment of the same subject in the opposite order, classical -> 
the Heisenberg picture -> the Schrodinger picture, see Finkelstein (1973), pp. 68-70 and 109. 




2.2 The Schrodinger Versus the Heisenberg Picture 



85 



We look at the observables pi and xi, which are understood to be the momen- 
tum and the position operator in the Heisenberg picture even though we omit the 
superscript ( H ). Because p, commutes with any function of pj ’s, we have 

^i = ~[ Pi ,H] = 0. (2.2.25) 

at in 

Thus for a free particle, the momentum operator is a constant of the motion, which 
means that pi(t) is the same as p, (0) at all times. Quite generally, it is evident from 
the Heisenberg equation of motion (2.2.19) that whenever commutes with 
the Hamiltonian, is a constant of the motion. Next, 



Al = ip iiH ] = A-Lm— i yp. 

dt ih'- ^ itilm dpi 1 “ J 



kJ =1 



Pi_ _ Pi(fy 
m 



m 



where we have taken advantage of (2.2.23a), so we have the solution 

*i(0 = *«(0)+ 



(2.2.26) 



(2.2.27) 



which is reminiscent of the classical trajectory equation for a uniform rectilinear 
motion. It is important to note that even though we have 



[xi(0), Xj (0)] = 0 



(2.2.28) 



at equal times, the commutator of the jc* ’ s at different times does not vanish; 
specifically, 



[x ; (0^i(0)] 



Pi(Q)t 

m 



,Xi( 0) 



— ifit 
m 



(2.2.29) 



Applying the uncertainty relation (1 .4.53) to this commutator, we obtain 

{(Axi) 2 ) t ((Ax i ) 2 ) t= o > ^ 2 - (2.2.30) 

Among other things, this relation implies that even if the particle is well localized 
at t = 0, its position becomes more and more uncertain with time, a conclusion 
that can also be obtained by studying the time-evolution behavior of free-particle 
wave packets in wave mechanics. 

We now add a potential V(x) to our earlier free-particle Hamiltonian: 

P 2 

H = Z- + V(x). 

Am 



(2.2.31) 




86 



Chapter 2 Quantum Dynamics 



Here V(x) is to be understood as a function of the x-, y-, and z-operators. Using 
(2.2.23b) this time, we obtain 

^ = ^r[p«,V ( x ) ] = -^-V ( x ) . (2.2.32) 

dt ih oxi 



On the other hånd, we see that 



dxi _ Pi_ 
dt m 



(2.2.33) 



still holds because xi commutes with the newly added term V(x). We can use the 
Heisenberg equation of motion once again to deduce 



d 2 Xi 



dt 2 



1 

ih 



'^L,H 

dt 



1 dpi 
m dt 



1 

ih 



—,H 

l m 



Combining this with (2.2.32), we finally obtain in vectorial form 



(2.2.34) 



m— = -W(x). (2.2.35) 

dt 2 

This is the quantum-mechanical analogue of Newton’s second law. By taking the 
expectation values of both sides with respect to a Heisenberg State ket that does 
not move with time, we obtain 



d 2 dip) 

m-^(x) = ^ = -(VVix)). (2.2.36) 

dt L dt 

This is known as the Ehrenfest theorem after P. Ehrenfest, who derived it in 
1 927 using the formalism of wave mechanics. When the theorem is written in 
this expectation form, its validity is independent of whether we are using the 
Heisenberg or the Schrodinger picture; after all, the expectation values are the 
same in the two pictures. In contrast, the operator form (2.2.35) is meaningful 
only if we understand x and p to be Heisenberg-picture operators. 

We note that in (2.2.36) the h’s have completely disappeared. It is therefore 
not surprising that the center of a wave packet moves like a classical particle 
subjected to V(x). 



Base Kets and Transition Amplitudes 

So far we have avoided asking how the base kets evolve with time. A common 
misconception is that as time goes on, all kets move in the Schrodinger picture 
and are stationary in the Heisenberg picture. This is not the case, as we will make 
clear shortly. The important point is to distinguish the behavior of State kets from 
that of base kets. 




2.2 The Schrodinger Versus the Heisenberg Picture 



87 



We started our discussion of ket spaces in Section 1.2 by remarking that the 
eigenkets of observables are to be used as base kets. What happens to the defining 
eigenvalue equation 



A\a') = a'\a') (2.2.37) 

with time? In the Schrodinger picture, A does not change, so the base kets, ob- 
tained as the solutions to this eigenvalue equation at t — 0, for instance, must re- 
main unchanged. Unlike State kets, the base kets do not change in the Schrodinger 
picture. 

The whole situation is very different in the Heisenberg picture, where the 
eigenvalue equation we must study is for the time-dependent operator 

A {H \t) = U^A(0)U. (2.2.38) 

From (2.2.37) evaluatedat t — 0, when the two pictures coincide, we deduce 

U' f A(0)UU' t \a / )=a'U^\a'), (2.2.39) 

which implies an eigenvalue equation for A 

| a')) = a'(U f la 7 ))- (2.2.40) 

If we continue to maintain the view that the eigenkets of observables form the 
base kets, then must be used as the base kets in the Heisenberg picture. 

As time goes on, the Heisenberg-picture base kets, denoted by | a',t)n, move as 
follows: 



\a',t) H = W t \a'). (2.2.41) 

Because of the appearance of U' rather than V. in (2.2.41), the Heisenberg-picture 
base kets are seen to rotate oppositely when compared with the Schrodinger- 
picture State kets; specifically, \a',t)f] satisfies the “wrong-sign Schrodinger equa- 
tion” 



ih^-\a',t) H = -H\a',t) H . (2.2.42) 

dt 

As for the eigenvalues themselves, we see from (2.2.40) that they are un- 
changed with time. This is consistent with the theorem on unitary equivalent ob- 
servables discussed in Section 1 .5 . Notice also the following expansion f or A ^ (t) 
in terms of the base kets and bras of the Heisenberg picture: 

A {H) (t) = J2 I a,t) H a' H (a',t\ (2.2.43) 

a' 

a' 

= U t A (S) U, 




88 



Chapter 2 Quantum Dynamics 



which shows that everything is quite consistent, provided that the Heisenberg base 
kets change as in (2.2.41). 

We see that the expansion coefficients ofa State ket in terms of base kets are 
the same in both pictures: 

c a '(t)= { a\ • ( < K|g'Qo = 0)) (the Schrodinger picture) (2.2.44a) 



base bra 



stateket 



c a '(t) = ((a'ltt) • \ct,to = 0) (the Heisenberg picture). (2.2.44b) 

base bra stateket 

Pictorially, we may say that the cosine of the angle between the State ket and the 
base ket is the same whether we rotate the State ket counterclockwise or the base 
ket clockwise. These considerations apply equally well to base kets that exhibit a 
continuous Spectrum; in particular, the wave function (x 7 la) can be regarded either 
as (1) the inner product of the stationary position eigenbra with the moving State 
ket (the Schrodinger picture) or as (2) the inner product of the moving position 
eigenbra with the stationary State ket (the Heisenberg picture). We will discuss 
the time dependence of the wave function in Section 2.4, where we will derive the 
celebrated wave equation of Schrodinger. 

To illustrate further the equivalence between the two pictures, we study transi- 
tion amplitudes, which will play a fundamental role in Section 2.6. Suppose there 
is a physical system prepared at t = 0 to be in an eigenstate of observable A with 
eigenvalue a' . At some la ter time t we may ask, What is the probability amplitude, 
known as the transition amplitude, for the system to be found in an eigenstate 
of observable B with eigenvalue b'l Here A and B can be the same or different. In 
the Schrodinger picture the State ket at t is given by V.\ a'), whereas the base kets 
| a') and | b') do not vary with time; so we have 



(b'\ - (Uld 1 )) 



basebra stateket 



(2.2.45) 



for this transition amplitude. In contrast, in the Heisenberg picture the State ket 
is stationary — that is, it remains as | a') at all times — but the base kets evolve 
oppositely. So the transition amplitude is 



mU) • | a') 



basebra stateket 



(2.2.46) 



Obviously (2.2.45) and (2.2.46) are the same. They can both be written as 

{b'\U(t,0)\a). (2.2.47) 

In some loose sense, this is the transition amplitude for “going” from State | a') to 
State \b'). 

To conclude this section, let us summarize the differences between the 
Schrodinger picture and the Heisenberg picture. Table 2.1 pro vides such a sum- 
mary. 




2.3 Simple Harmonic Oscillator 89 



TABLE 2.1 The Schrodinger Picture versus the Heisenberg Picture 





Schrodinger picture 


Heisenberg picture 


State ket 


Moving: (2.1.5), (2.1.27) 


Stationary 


Observable 


Stationary 


Moving: (2.2.10), (2.2.19) 


Base ket 


Stationary 


Moving oppositely: (2.2.41), (2.2.42) 



2.3 ■ SIMPLE HARMONIC OSCILLATOR 



The simple harmonic oscillator is one of the most important problems in quantum 
mechanics. It not only illustrates many of the basic concepts and methods of quan- 
tum mechanics but also has much practical value. Essentially any potential well 
can be approximated by a simple harmonic oscillator, so it describes phenomena 
from molecular vibrations to nuclear structure. Moreover, because the Hamilto- 
nian is basically the sum of squares of two canonically conj ugate variables, it is 
also an important starting point for much of quantum held theory. 



Energy Eigenkets and Energy Eigenvalues 

We begin our discussion with Dirac’s elegant operator method, which is based on 
the earlier work of M. Born and N. Wiener, to obtain the energy eigenkets and 
energy eigenvalues of the simple harmonic oscillator. The basic Hamiltonian is 




9 2 
mco L x 



(2.3.1) 



where co is the angular frequency of the classical oscillator related to the spring 
constant k in Hooke’s law via co = s/k/ m. The operators x and p are, of course, 
Hermitian. It is convenient to define two non-Hermitian operators, 



a = 






(2.3.2) 



which are known as the annihilation operator and the creation operator, re- 

spectively, for reasons that will become evident shortly. Using the canonical com- 
mutation relations, we readily obtain 




(~i[x,p] + i[p,x])= 1. 



We also define the number operator 

N =a^a, 



(2.3.3) 



(2.3.4) 



which is obviously Hermitian. Itis straightforward to show that 

J2 



a'a = 



mco 



2 h 

H _I 
h//~ 2 ’ 



2 , P 
x -I 



+ I M IM 



2 2 
m L (x> L 



(2.3.5) 






90 



Chapter 2 Quantum Dynamics 



so we have an important relation between the number operator and the Hamilto- 
nian operator: 

H = tuo(N + ±y (2.3.6) 

Because H is just a linear function of N, N can be diagonalized simultaneously 
with H. We denote an energy eigenket of N by its eigenvalue n, so 

N\n) — n\n). (2.3.7) 

We will later show that n must be a nonnegative integer. Because of (2.3.6) we 
also have 

H\n) = (n + ^jhco\n), (2.3.8) 

which means that the energy eigenvalues are given by 

E n = (n + i) hæ. (2.3.9) 

To appreciate the physical significance of a, a', and N, let us first note that 

[7V,a] = [a^a,a\ — a\a,a\ + [ a\a]a — —a, (2.3.10) 

where we have used (2.3.3). Likewise, we can derive 

[iV,a t ] = a t . (2.3.11) 

As a result, we have 

Na '■ | n) = ([A,a^] + a^A)|n) = (n + l)c* \n) (2.3.12a) 

and 

Na\n) — ([N,a]+aN)\n) —{n — \)a\n). (2.3.12b) 

These relations imply that a' \n)(a |n)) is also an eigenket of N with eigenvalue in- 
creased (decreased) by one. Because the increase (decrease) of n by one amounts 
to the creation (annihilation) of one quantum unit of energy hco, the term creation 
operator (annihilation operator) for a ' (a) is deemed appropriate. 

Equation (2.3.12b) implies that a\n) and | n — 1) are the same up to a multi- 
plicative constant. We write 

a\n) = c\n — 1), (2.3.13) 

where c is a numerical constant to be determined from the requirement that both 
| n) and \n — 1) be normalized. First, note that 

(n\a^a\n) — |c| 2 , 



(2.3.14) 




2.3 Simple Harmonic Oscillator 



91 



We can evaluate the left-hand side of (2.3. 14) by noting that a^a is just the number 
operator, so 

n = |c| 2 . (2.3.15) 

Taking c to be real and positive by convention, we finally obtain 

a\n) = Vn\n-l). (2.3.16) 

Similarly, it is easy to show that 

a*\ri) = V« + l|n + 1). (2.3.17) 

Suppose that we keep on applying the annihilation operator a to both sides of 
(2.3.16): 

a 2 \n ) - ^/n(n — 1)| n — 2), 

a 3 \n) = ~Jn{n — 1 )(« — 2)\ n — 3), (2.3.18) 



We can obtain numerical operator eigenkets with smaller and smaller n until the 
sequence terminates, which is bound to happen whenever we start with a positive 
integer n. One may argue that if we start with a noninteger n, the sequence will 
not terminate, leading to eigenkets with a negative value of n. But we also have 
the positivity requirement for the norm of a \n ) : 

n — ( n\N\n ) = ((mIæ^) ■ (a\n)) > 0, (2.3.19) 

which implies that n can never be negative ! So we conclude that the sequence must 
terminate with n = 0 and that the allowed values of n are nonnegative integers. 

Because the smallest possible value of n is zero, the ground State of the har- 
monic oscillator has 

E 0 = Uio). (2.3.20) 

We can now successively apply the creation operator ci* to the ground State |0). 
Using (2.3.17), we obtain 



|l)=a t |0), 




(2.3.21) 




92 



Chapter 2 Quantum Dynamics 



In this way we have succeeded in constructing simultaneous eigenkets of N 
and H with energy eigenvalues 

E n = (n + ^ha> (n = 0, 1,2,3,...). (2.3.22) 

From (2.3.16), (2.3.17), and the orthonormality requirement for {I«)}, we ob- 
tain the matrix elements 

(n'\a\n) = -v/«<V,n-i> (n'\a^\n) = V« + l<V,n+i- (2.3.23) 

Using these together with 

/ h + mhco + 

X== V2 + F = 7y-^-"(- fl + tf ), (2.3.24) 

we derive the matrix elements of the x and p operators: 

(ri\x\n) = J —— (y/nS n > n — i + \/n + 15„/,n+l)> (2.3.25a) 

V 2 mco 



, / irilLLU , — / 

{n \p\n) =iy — (-V«5„/ ;n -i + V« + l<V,«+i). (2.3.25b) 

Notice that neither x nor p is diagonal in the iV-representation we are using. This 
is not surprising because x and p, like a and a ] , do not commute with N. 

The operator method can also be used to obtain the energy eigenfunctions in 
position space. Let us start with the ground State defined by 

a|0) = 0, (2.3.26) 



which, in the x-representation, reads 



, I mco , ( ip \ 

(*'\a |0> = (x \ x + — 10) = 0. (2.3.27) 

V 2 n \ mco ) 

Recalling (1.7.17), we can regard this as a differential equation for the ground- 
state wave function (x'|0): 

(x' +xq^t) ( x /|0) = 0, (2.3.28) 

where we have introduced 




which sets the length scale of the oscillator. We see that the 
to (2.3.28) is 



(2.3.29) 
normalized solution 




(2.3.30) 




2.3 Simple Harmonic Oscillator 



93 



We can also obtain the energy eigenfunctions for excited States by evaluating 



<jc'| 1 > = <jcV|0>- 



{x ' m = (71) 



1 



•s/z 



XQ 






2 d 



l(flt)2| ° ) = ( ^l)(^) ( X '- X2 ^) {X ' l0) 

(2.3.31) 



In general, we obtain 
1 



(x'\n) - 



7t l /*y/2Wj W" +1/2 



x — X. 



°dp* exp 



1 ( X 



!\ 2 " 



2 \*0 



. (2.3.32) 



It is instructive to look at the expectation values of x 2 and p 2 for the ground 
State. First, note that 



x 2 = 



h 



2 mo) 



(a A + a^ 2 + a J a + aa T ) 






(2.3.33) 



When we take the expectation value of x 2 , only the last term in (2.3.33) yields a 
nonvanishing con tribution: 



(x 2 ) = 



h 



2 mu> 



Likewise, 



(P 2 ) = 



hmco 



(2.3.34) 



(2.3.35) 



It folio ws that the expectation values of the kinetic and the potential energies are, 
respectively, 



2 

2 m 



p“ \ hco ( H ) 



and 



mco 2 x 2 \ hco ( H } 

'' T _ ~T > 



(2.3.36) 



as expected from the virial theorem. From (2.3.25a) and (2.3.25b), it follows that 

(*> = (P>=0, (2.3.37) 

which also holds for the excited States. We therefore have 



<(Ax) 2 ) = (x 2 ) = 



h 



2 mco 



and {(A p) 2 ) = (p 2 ) = 



hmco 



2 ’ 



(2.3.38) 



and we see that the uncertainty relation is satisfied in the minimum uncertainty 
product form: 

h 2 

((Ax) 2 )((A p) 2 ) = -. 



(2.3.39) 




94 



Chapter 2 Quantum Dynamics 



This is not surprising because the ground-state wave function has a Gaussian 
shape. In contrast, the uncertainty products for the excited States are larger: 

<(Ax) 2 )<(Ap) 2 )=(n + ^ h\ (2.3.40) 

as the reader may easily verify. 



Time Development of the Oscillator 

So far we have not discussed the time evolution of oscillator State kets or of ob- 
servables such as x and p. Everything we have done is supposed to hold at some 
instant of time, say at t = 0; the operators x, p, a, and a ' are to be regarded either 
as Schrodinger-picture operators (at all t) or as Heisenberg-picture operators at 
t = 0. In the remaining part of this section, we work exclusively in the Heisen- 
berg picture, which means that x, p, a, and a ' are all time-dependent even though 
we do not explicitly write x^ H \t), and so forth. 

The Heisenberg equations of motion forp and x are, from (2.2.32) and (2.2.33), 



and 



dp 2 

— - —mcox 
dt 



(2.3.41a) 



dx p 
dt m 



(2.3.41b) 



This pair of coupled differential equations is equivalent to two uncoupled differ- 
ential equations for a and , namely, 



and 




whose solutions are 



dcft 

dt 



- i coa 



t 



(2.3.42a) 



(2.3.42b) 



a(t) = a(0)exp(— icot) and a^(t) = a^(0)exp(icot). (2.3.43) 



Incidentally, these relations explicitly show that N and H are time-independent 
operators even in the Heisenberg picture, as they must be. In terms of x and p, we 
can rewrite (2.3.43) as 



ip(t) T p(0) , 

x(t)-\ =x(0)exp(— icot) + i exp(— icot), 

mco [_ ma) 

„v i pit) . r^ 0 )", ^ 

x(t) = x(U)exp (icot)-i exppftjr)- 

mco mco 



(2.3.44) 




2.3 Simple Harmonic Oscillator 



95 



Equating the Hermitian and anti-Hermitian parts of both sides separately, we de- 
duce 



x{t ) = x(0) cos cot + 



>( 0 ) ~ 

mco 



sin cut 



(2.3.45a) 



and 



p(t) = — mcox(0) sincut + p(0)coscot. (2.3.45b) 



These look the same as the classical equations of motion. We see that the x and p 
operators “oscillate” just like their classical analogues. 

For pedagogical reasons, we now present an alternative derivation of (2.3 ,45a). 
Instead of solving the Heisenberg equation of motion, we attempt to evaluate 



,iHt\ n ( —iHt 

x(t) = exp ( —j— 1 x(0)exp 



(2.3.46) 



To this end we record a very useful formula: 
exp(i Gk)A exp (—i Gk) = A + ik [G, A] + 






i n k n 



ni 



i 2 k 2 
2 ! 



[G,[G,A]] 



(2.3.47) 



[G,[G,[G,...[G,A]]]...] + 



where G is a Hermitian operator and k is a real parameter. We leave the proof 
of this formula, which is known as the Baker-Hausdorff lemma, as an exercise. 
Applying this formula to (2.3.46), we obtain 



iHt\ n i -iHt 
exp I )x(0)exp 



h 



h 



it 



= x(0) + ( - )[//,x(0)] + 
n 



ih 2 
2 m 2 



(2.3.48) 



[H,[H, x(0)]] + . 



Each term on the right-hand side can be reduced to either x or p by repeatedly 
using 



[#,*( 0 )] 



—ifip( 0) 
m 



(2.3.49a) 



and 



[//,p(0)] = itimco 2 x( 0). 



(2.3.49b) 




96 



Chapter2 Quantum Dynamics 



Thus 



iHt\ ( —iHt 

exp| — )x(0)exp 



h 



n 



= *( 0 ) + 



p(py 

m 

#3 ..,2 



t- I ^ ) t 2 æ 2 x(0) 



1 \ fWp( 0) 



,3! 

= x(0)cos æt + 



+ ... 



(2.3.50) 



m 



P( 0) 



mco 



sin 



in agreement with (2.3.45a). 

From (2.3 .45a) and (2.3 .45b), one may be tempted to conclude that {x ) and ( p ) 
always oscillate with angular frequency co. However, this inference is not correct. 
Take any energy eigenstate characterized by a definite value of n\ the expectation 
value (n\x(t)\n) vanishes because the operators x(0) and p( 0) change n by ±1, 
and | n) and | n ± 1) are orthogonal. This point is also obvious from our earlier 
conclusion (see Section 2.1) that the expectation value of an observable taken 
with respect to a stationary State does not vary with time. To observe oscillations 
reminiscent of the classical oscillator, we must look at a superposition of energy 
eigenstates such as 



|a)=c 0 |0) + ci|l). (2.3.51) 

The expectation value of x{t) taken with respect to (2.3.51) does oscillate, as the 
reader may readily verify. 

We have seen that an energy eigenstate does not behave like the classical 
oscillator — in the sense of oscillating expectation values for x and p — no matter 
how large n may be. We may logically ask, How can we construct a superposition 
of energy eigenstates that most closely imitates the classical oscillator? In wave- 
function language, we want a wave packet that bounces back and forth without 
spreading in shape. It turns out that a coherent State defined by the eigenvalue 
equation for the non-Hermitian annihilation operator a, 

a|X)=X|X), (2.3.52) 

with, in general, a complex eigenvalue X does the desired job. The coherent State 
has many other remarkable properties: 

1. When it is expressed as a superposition of energy (or N) eigenstates, 

OO 

\X) = J^f(.n)\n), (2.3.53) 

n = 0 

the distribution of \f{n)\ 2 with respect to n is of the Poisson type about 
some mean value h\ 




exp(— n). 



(2.3.54) 




2.4 Schrodinger's Wave Equation 



97 



2. It can be obtained by translating the oscillator ground State by some finite 
distance. 

3. It satisfies the minimum uncertainty product relation at all times. 

A systematic study of coherent States, pioneered by R. Glauber, is very rewarding; 
the reader is urged to work out Problem 2.19 on this subject at the end of this 
chapter.* 



2.4 ■ SCHRODINGER'S WAVE EQUATION 
The Time-Dependent Wave Equation 

We now turn to the Schrodinger picture and examine the time evolution of \a, to',t) 
in the x-representation. In other words, our task is to study the behavior of the 
wave function 



= (x\a,tQ-,t) 



(2.4.1) 



as a function of time, where \a,to\t) is a State ket in the Schrodinger picture at 
time t, and (x'| is a time-independent position eigenbra with eigenvalue x'. The 
Hamiltonian operator is taken to be 

H — E V (x). (2.4.2) 

2 m 

The potential V(x) is a Hermitian operator; it is also local in the sense that in the 
x-representation we have 

(x"| V(x)lx') = V(x')S 3 (x' -x"), (2.4.3) 



where V(x') is a real function of x' . Later in this book we will consider more- 
complicated Hamiltonians — a time-dependent potential V(x,t); a nonlocal but 
separable potential where the right-hand side of (2.4.3) is replaced by v\ (x")u 2 (x'); 
a momentum-dependent interaction o f the form p • A + A • p, where A i s the vector 
potential in electrodynamics, and so on. 

We now deri ve Schrodinger’s time-dependent wave equation. We first write the 
Schrodinger equation for a State ket (2.1.27) in the x-representation: 

g 

ih — (x'|a,fo;f> = {x'\H\a,toU), (2.4.4) 

at 

where we have used the faet that the position eigenbras in the Schrodinger picture 
do not change with time. Using (1.7.20), we can write the kinetic-energy contri- 
bution to the right-hand side of (2.4.4) as 

jv ,2 (xWo;f>. (2.4.5) 




*For applications to laser physics, see Sargent, Scully, and Lamb (1974) and Loudon (2000). See 
also the discussion on squeezed light at the end of Section 7.6 of this book. 




98 



Chapter 2 Quantum Dynamics 



As for V(x), we simply use 

(x'|V(x) = (x'|V(x'), (2.4.6) 

where V (x') is no longer an operator. Combining everything, we deduce 

ih^-(x'\ce,to;t) = - ( ~ ) SE 2 (x'\a,t 0 -,t) + V(x'){x'\a,to-t), (2.4.7) 

dt \ 2 m ) 

which we recognize to be the celebrated time-dependent wave equation of E. 
Schrodinger, usually written as 

ih^(x',t) = - |^]v' 2 ^(x' 1 f) + V(xW4). (2.4.8) 

dt \ 2 ml 

The quantum mechanics based on wave equation (2.4.8) is known as wave me- 
chanics. This equation is, in faet, the starting point of many textbooks on quantum 
mechanics. In our formalism, however, this is just the Schrodinger equation for a 
State ket written explicitly in the x-basis when the Hamiltonian operator is taken 
to be (2.4.2). 



The Time-lndependent Wave Equation 

We now derive the partial differential equation satisfied by energy eigenfunetions. 
We showed in Section 2. 1 that the time dependence of a stationary State is given 
by exp (—iE a it /h). This enables us to write its wave funetion as 

(xVTo;0 = (x'lfl')exp^ - , (2.4.9) 

where it is understood that initially the system is prepared in a simultaneous eigen- 
state of A and H with eigenvalues a and E a >, respectively. Let us now substitute 
(2.4.9) into the time-dependent Schrodinger equation (2.4.7). We are then led to 




V /2 (xV) + V(x')(xV) = Ea'&W). 



(2.4.10) 



This partial differential equation is satisfied by the energy eigenfunetion (x'| a') 
with energy eigenvalue E a >. Actually, in wave mechanics where the Hamiltonian 
operator is given as a funetion of x and p, as in (2.4.2), it is not necessary to refer 
explicitly to observable A that commutes with H, because we can always chooseA 
to be that funetion of the observables x and p that coincides with H itself. We may 
therefore omit reference to a! and simply write (2.4.10) as the partial differential 
equation to be satisfied by the energy eigenfunetion M£(x'): 

^ W , 2 ue^) + V(x')u e (x') = Eu e (x'). (2.4.11) 





2.4 Schrodinger's Wave Equation 



99 



This is the time-independent wave equation of E. Schrodinger — announced in 
the first of four monumental papers, all written in the first half of 1926 — that laid 
the foundations of wave mechanics. In the same paper he immediately applied 
(2.4.1 1) to deri ve the energy Spectrum of the hydrogen atom. 

To solve (2.4.11) some boundary condition has to be imposed. Suppose we 
seek a solution to (2.4.1 1) with 

E< lim V(x'), (2.4.12) 

|x '|->-00 



where the inequality relation is to holdfor |x'| -* oo in any direction. The appro- 
priate boundary condition to be used in this case is 

ue( x ') -> 0 as |x'| -> oo. (2.4.13) 

Physically this means that the particle is bound or confined within a finite region 
of space. We know from the theory of partial differential equations that (2.4.1 1) 
subject to boundary condition (2.4.13) allows nontrivial solutions only for a dis- 
crete set of values of E. It is in this sense that the time-independent Schrodinger 
equation (2.4.1 1) yields the quantization of energy levels.* Once the partial dif- 
ferential equation (2.4.1 1) is written, the problem of finding the energy levels of 
microscopic physical systems is as straightforward as that of finding the char- 
acteristic frequencies of vibrating strings or membranes. In both cases we solve 
boundary- value problems in mathematical physics. 

A short digression on the history of quantum mechanics is in order here. The 
faet that exaetly soluble eigenvalue problems in the theory of partial differential 
equations can also be treated using matrix methods was already known to math- 
ematicians in the first quarter of the twentieth century. Furthermore, theoretical 
physicists like M. Born frequently consulted great mathematicians of the day — 
D. Hilbert and H. Weyl, in particular. Yet when matrix mechanics was born in 
the summer of 1925, it did not immediately occur to the theoretical physicists 
or the mathematicians to reformulate it using the language of partial differential 
equations. Six months after Heisenberg’s pioneering paper, wave mechanics was 
proposed by Schrodinger. However, a close inspection of his papers shows that 
he was not at all influenced by the earlier works of Heisenberg, Born, and Jordan. 
Instead, the train of reasoning that led Schrodinger to formulate wave mechanics 
has its roots in W. R. Hamilton’s analogy between optics and mechanics, on which 
we will comment later, and in the particle-wave hypothesis ofL. de Broglie. Once 
wave mechanics was formulated, many people, including Schrodinger himself, 
showed the equivalence between wave mechanics and matrix mechanics. 

It is assumed that the reader of this book has some experience in solving the 
time-dependent and time-independent wave equations. He or she should be fa- 
miliar with the time evolution of a Gaussian wave packet in a force-free region; 
should be able to solve one-dimensional transmission-reflection problems involv- 
ing a rectangular potential barrier, and the like; should have seen derived some 

*Schrodinger’s paper that announced (2.4.11) is appropriately entitled Quantisierung als Eigen- 
wertproblem ( Quantization as an Eigenvalue Problem). 




100 



Chapter 2 Quantum Dynamics 



simple solutions of the time-independent wave equation — a particle in a box, a 
particle in a square well, the simple harmonic oscillator, the hydrogen atom, and 
so on; and should also be familiar with some general properties of the energy 
eigenfunctions and energy eigenvalues, such as (1) the faet that the energy levels 
exhibit a discrete or continuous Spectrum depending on whether or not (2.4.12) 
is satisfied and (2) the property that the energy eigenfunetion in one dimension is 
sinusoidal or damped depending on whether E — V {x') is positive or negative. 

In this book, we do not thoroughly cover these more elementary topics and so- 
lutions. Some of these (for example, the harmonic oscillator and hydrogen atom) 
are pursued, but at a mathematical level somewhat higher than what is usually 
seen in undergraduate courses. In any case, a brief summary of elementary solu- 
tions to Schrodinger’s equations is presented in Appendix B. 

Interpretations of the Wave Function 

We now turn to discussions of the physical interpretations of the wave function. 
In Section 1.7 we commented on the probabilistic interpretation of 1^1 2 that fol- 
lows from the faet that (x'|o',to;t) is to be regarded as an expansion coefficient of 
|cMo;t) in terms of the position eigenkets (|x')}. The quantity p(x',t ) defined by 

p(x',t) = |i/ r (x / ,?)| 2 = |(x'|a4o;OI 2 (2.4.14) 

is therefore regarded as the probability density in wave mechanics. Specifically, 
when we use a detector that ascertains the presence of the particle within a small 
volume element d 3 x' around x', the probability of recording a positive result at 
time t is given by p(x',t)d 3 x'. 

In the remainder of this section we use x for x' because the position operator 
will not appear. Using Schrodinger’s time-dependent wave equation, it is straight- 
forward to derive the continuity equation 

^T + V.j = 0, (2.4.15) 

at 

where p(x, t) stands for | x// \ 2 as before, and j(x, t), known as the probability flux, 
is given by 



jom) = - ( ^ ) o* 1 - wv] 



Im(V^*Vi/r). 



(2.4.16) 



The reality of the potential V (or the Hermiticity of the V operator) has played 
a crucial role in our obtaining this result. Conversely, a complex potential can 
phenomenologically account for the disappearance of a particle; such a potential 
is often used for nuclear reactions where incident particles get absorbed by nuclei. 




2.4 Schrodinger's Wave Equation 



101 



We may intuitively expect that the probability flux j is related to momentum. 
This is indeed the case for j int egrat ed over all space. From (2.4.16) we obtain 



‘ , 3 v „ <P)t 

d 3 xj(x,t) = , 

m 



(2.4.17) 



where (p) r is the expectation value of the momentum operator at time t. 

Equation (2.4. 15) is reminiscent of the continuity equation in fluid dynamics 
that characterizes a hydrodynamic flow of a fluid in a source-free, sink-free region. 
Indeed, historically Schrodinger was first led to interpret | \ 2 as the actual matter 
density, or e | xfr | 2 as the actual electric charge density. If we adopt such a view, we 
are led to face some bizarre consequences. 

A typical argument for a position measurement might go as follows. An atomic 
electron is to be regarded as a continuous distribution of matter Alling up a finite 
region of space around the nucleus; yet, when a measurement is made to make 
sure that the electron is at some particular point, this continuous distribution of 
matter suddenly shrinks to a point-like particle with no spatial extension. The 
more satisfactory statistical interpretation of \i/r \ 2 as the probability density was 
first given by M. Born. 

To understand the physical significance of the wave function, let us write it as 



xfr(x,t) = ^/p(x,t)QX p 



iS(x,t ) 
ti 



(2.4.18) 



with S real and p > 0, which can always be donefor any complex function of x and 
t. The meaning of p has already been given. What is the physical interpretation of 
57 Noting 



= Vp V( v / p)+ pV5, 



(2.4.19) 



we can write the probability flux as [see (2.4.16)] 



j = 



pVS 

m 



(2.4.20) 



We now see that there is more to the wave function than the faet that \\fr \ 2 is the 
probability density; the gradient of the phase 5 contains a vital piece of infor- 
mation. From (2.4.20) we see that the spatial variation of the phase of the wave 
function characterizes the probability flux; the stronger the phase variation, the 
more intense the flux. The direction of j at some point x is seen to be normal to 
the surface of a constant phase that goes through that point. In the particularly 
simple example of a plane wave (a momentum eigenfunetion), 



i/r(x,t)aexp^^p-^^, (2.4.21) 

where p stands for the eigen value of the momentum operator. All this is evident 
because 



V5 = p. 



(2.4.22) 




102 



Chapter 2 Quantum Dynamics 



More generally, it is tempting to regard V S/ m as some kind of “velocity ,” 



VS ' 
m 



(2.4.23) 



and to write the continuity equation (2.4.15) as 

~ + V • (p “v”) = 0, (2.4.24) 

at 

just as in fluid dynamics. However, we would like to caution the reader against 
too literal an interpretation of j as p times the velocity defined at every point in 
space, because a simultaneous precision measurement of position and velocity 
would necessarily violate the uncertainty principle. 



The Classical Limit 

We now discuss the classical limit of wave mechanics. First, we substitute <{/ 
written in form (2.4.18) into both sides of the time-dependent wave equation. 
Straigh tforward differentiations lead to 




(!) 



vVp+ - (vvp).(vs)- 



^)V^i 2 + 



y^v 2 s 



■ Vp v 



= i ti 



dyp 

dt 



+ 



i\ r-dS 

ti)^ dt 



(2.4.25) 



So far everythinghas been exact. Let us supposenow that h can, in some sense, be 
regarded as a small quantity. The precise physical meaning of this approximation, 
to which we will come back later, is not evident now, but let us assume 



ft|V 2 S|«|VS| 2 , 



(2.4.26) 



and so forth. We can then collect terms in (2.4.25) that do not explicitly contain ti 
to obtain a nonlinear partial differential equation for S: 

1 - 3 S(x,t) 

— |VS(x,0| 2 + V(x) + — 4-"=0. (2.4.27) 

2 m dt 

We recognize this to be the Hamilton-Jacobi equation in classical mechanics, 
first written in 1 836, where S(x, t) stands for Hamilton’s principal function. So, not 
surprisingly, in the ti 0 limit, classical mechanics is contained in Schrodinger’s 
wave mechanics. We have a semiclassical interpretation of the phase of the wave 
function: ti times the phase is equal to Hamilton’s principal function, provided 
that ti can be regarded as a small quantity. 




2.5 Elementary Solutions to Schrodinger's Wave Equation 



103 



Let us now look at a stationary State with time dependence exp(— i Et /Ti). This 
time dependence is anticipated from the faet that for a classical system with a 
constant Hamiltonian, Hamilton’s principal funetion S is separable: 

S(x,t) = W(x) — Et, (2.4.28) 

where W(x) is called Hamilton’s characteristic funetion (Goldstein 2002, 
pp. 440-44). As time goes on, a surface of a constant S advances in mueh the same 
way as a surface of a constant phase in wave optics — a “wave front” — advances. 
The momentum in the classical Hamilton-Jacobi theory is given by 

Pclass = VS = VW, (2.4.29) 

which is consistent with our earlier Identification of V S/m with some kind of 
velocity. In classical mechanics the velocity vector is tangential to the particle 
trajectory, and as a result we can trace the trajectory by following continuously the 
direction of the velocity vector. The particle trajectory is like a ray in geometric 
optics because the V S that traces the trajectory is normal to the wave front defined 
by a constant S. In this sense, geometrical optics is to wave optics what classical 
mechanics is to wave mechanics. 

One might wonder, in hindsight, why this optical-mechanical analogy was not 
fully exploited in the nineteenth century. The reason is that there was no moti- 
vation for regarding Hamilton’s principal funetion as the phase of some traveling 
wave; the wave nature of a material particle did not become apparent until the 
1920s. Besides, the basic unit of action h, which must enter into (2.4.18) for di- 
mensional reasons, was missing in the physics of the nineteenth century. 



2.5 ■ ELEMENTARY SOLUTIONS TO SCHRODINGER'S 
WAVE EQUATION 

It is both instructive and useful to look at some relatively elementary solutions 
to (2.4.11) for particular choices of the potential-energy funetion V(x). In this 
section we choose some examples that illustrate contemporary physics and/or will 
be useful in later chapters of this textbook. 

Free Particle in Three Dimensions 

The case V(x) = 0 has fundamental significance. We will consider the solution 
to Schrodinger’s equation here in three dimensions using Cartesian coordinates. 
The solution in spherical coordinates will be left until our treatment of angular 
momentum is presented in the next chapter. Equation (2.4.1 1) becomes 

o 2mE 

V 2 m £ (x) = -=-u E (x). (2.5.1) 



Define a vector k where 



k 2 = k\ + ky + k 2 z = 



2 mE 



(2.5.2) 




104 



Chapter 2 Quantum Dynamics 



that is, p = h k. Differential equation (2.5.1) is easily solved using the technique 
known as “separation of variables.” Writing 

UE(x) = u x (x)u y (y)u z (z), (2.5.3) 



we arrive at 



1 d 2 u x 
u x dx 2 




+ 



1 d 2 Uy 
Uy dy 2 




1 d 2 u z 
u z dz 2 




= 0 



(2.5.4) 



This leads to individual plane-wave solutions u w (w) = c w e lkwW for w = x,y,z. 
Note that one gets the same energy E for values ±k w . 

Collecting these solutions and combining the normalization constants, we ob- 
tain 



u E (x) = c x c y c z e ik * x+ik yy +ik * = Ce ik *. (2.5.5) 

The normalization constant C presents the usual difficulties, which are generally 
håndled by using a 5-function normalization condition. It is convenient in many 
cases, however, to use a “big box” normalization, where all space is contained 
within a cube of side length L. We impose periodic boundary conditions on the 
box and thereby obtain a finite normalization constant C. For any real calculation, 
we simply let the size L — > oo at the end of the calculation. 

Imposing the condition u x (x + L) — u x (x), weha vek x L = 2 nn x , where n x is 
an integer. That is, 



2n 2 tt 2tz 




(2.5.6) 



and the normalization criterion becomes 



1 = 




dzu* E (x)u E {x ) = L 3 \C\ 2 , 



(2.5.7) 



in which case C = 1/L 3 / 2 and 



«e(x) = 



1 

JJJ2 



e 



i'k-x 



(2.5.8) 



The energy eigenvalue is 




2 m 



h 2 k 2 
2 m 



ft 2 
2 m 




(2.5.9) 



The sixfold degeneracy we mentioned earlier corresponds to the six combinations 
of (±n x ,±n y ,±n z ), but the degeneracy can actually be much larger since, in 
some cases, there are various combinations of n x , n y , and n 7 that can give the 
same E. In faet, in the (realistic) limit where L is very large, there can be a large 




2.5 Elementary Solutions to Schrodinger's Wave Equation 



105 



number of States N that have an energy between E and E + dE. This “density 
of States” dN /dE is an important quantity for calculations of processes that in- 
clude free particles. See, for example, the discussion of the photoelectric effect in 
Section 5.8. 

To calculate the density of States, imagine a spherical shell in k space with 
radius |k| = 2^|n|/L and thickness d|k| = 27r<f|n|/L. All States within this 
shell have energy E = ti 2 k 2 /2m. The number of States dN within this shell is 
47rn 2 c/|n|. Therefore, 



dN 47rn 2 ^|n| An ( L \ 2 L 

dE /i 2 |k|J|k|/m h 2 \2n J 2n 

m 3/2 E 1/2 L 3 

V2n 2 h 3 



(2.5.10) 



In a typical “real” calculation, the density of States will be multiplied by some 
probability that involves u* e (x)ue(x). In this case, the factors of L 3 will cancel 
explicitly, so the limit L — > oo is trivial. This “big box” normalization also yields 
the correct answer for the probability flux. Rewriting (2.4.21) with this normal- 
ization, we have 



1 /zp*x iEt\ 

iA(x,0 = JJJ 2 ' ex P (- -y- - ~Y ) ’ (2 - 5J 1} 

in which case we find 

ti hk 1 

j(x,t)=-Im(iA*ViA) = — tt=vp, (2.5.12) 

m m 

where p = 1/L 3 is indeed the probability density. 



The Simple Harmonic Oscillator 

In Section 2.3 we saw an elegant solution for the case V(x) = mco 2 x 2 / 2 that 
yielded the energy eigenvalues, eigenstates, and wave functions. Here, we demon- 
strate a different approach that sol ves the differential equation 

ti 2 d 2 1 2 2 

^ue(x) + -mco x ue(x) = Eue(x). (2.5.13) 

2 m dx l 2 

Our approach will introducethe concept of generating functions, a generally use- 
ful technique that arises in many treatments of differential eigenvalue problems. 

First, transform (2.5.13) using the dimensionless position y = x/xq, where 
xo = y/ ti /mco . Also introduce a dimensionless energy variable s = 2E /hco. The 
differential equation we need to solve therefore becomes 

d 2 n 

-^u(y) + (s-y 2 )u(y) = 0. (2.5.14) 

For y — > ±oo, the solution must tend to zero; otherwise the wave function 
will not be normalizable and hence unphysical. The differential equation 




106 



Chapter 2 Quantum Dynamics 



w"(y) — y 2 vo{y) = 0 has solutions vo{y) oc exp(±y 2 /2), so we have to choose the 
minus sign. We then “remove” the asymptotic behavior of the wave function by 
writing 



u(y) = h(y)e yl/2 , 

where the function h(y) satisfies the differential equation 

d 2 h dh 

— -2y^- + (e-l)h(y) = 0. 
dy 1 dy 



(2.5.15) 



(2.5.16) 



To this point, we have followedthe traditional solution of the simple harmonic 
oscillator as found in many textbooks. Typically, one would now look for a se- 
ries solution for h(y) and discover that a normalizable solution is possible only 
if the series terminates. (In faet, we use this approach for the three-dimensional 
isotropic harmonic oscillator in this book. See Section 3.7.) One forces this ter- 
mination by imposing the condition that s — 1 be an even, nonnegative integer 

2n, n = 0, 1,2, The solutions are then written using the resulting polynomials 

h n {y). Of course, s — 1 = 2n is equivalent to E = (n + the quantization 

relation (2.3.22). 

Let us take a different approach. Consider the “Hermite polynomials” H n (x) 
defined by the “generating function” g{x,t) through 



g(x,t) = e 



—t l +2tx 



s £>(*)£ 

“ n! 

n—0 



(2.5.17a) 

(2.5.17b) 



Some properties of the H n {x) are immediately obvious. For example, Hq(x) = 1. 
Also, because 



g (0,,) = e - 2 = Z~f‘ 2 ". 



n = 0 



(2.5.18) 



it is clear that H n { 0) = 0 if n is odd, since this series involves only even powers 
of t. On the other hånd, if we restrict ourselves to even values of n, we have 



g(0 



^ (n/2)! ^ 



n = 0 



72=0 



(— l)(rc/2) n\ 

(n/2)! n! 



(2.5.19) 



and so H n (0) = (— l)"/ 2 n!/(n/2)!. Also, since g(—x,t) reverses the sign only on 
terms with odd powers of t, H n (—x) = (— l) n H n (x). 

We can take derivatives of g(x,t ) to build the Hermite polynomials using re- 
cursion relations between them and their derivatives. The trick is that we can 
differentiate the analytic form of the generating function (2.5.17a) or the series 




2.5 Elementary Solutions to Schrodinger's Wave Equation 



107 



form (2.5.17b) and then compare results. For example, if we take the derivative 
using (2.5.17a), then 



9g 

dx 



oo 

= 2 tg(x,t) — YlHnjx) 

n = 0 



t n+ 1 
n\ 



oo 

2(71 + 1 )#„(*) 

n=0 



7 n+1 

(71 + 1)!’ 



(2.5.20) 



where we insert the series definition of the generating function after taking the 
derivative. On the other hånd, we can take the derivative of (2.5.17b) directly, in 
which case 



dg 

dx 



n = 0 



t n 

n\ 



Comparing (2.5.20) and (2.5.21) shows that 



H' n {x) = 2nHn-i{x). 



This is enough information for us build the Hermite polynomials: 



Hq{x) = 1 

so H[{x) — 2, therefore H\(x) — 2x 
so H^x) — 8x, therefore H 2 {x) — 4x 2 — 2 
so Hj(x) — 24 x 2 — 12, therefore Hi{x) — 8v 3 — \ 2x 



(2.5.21) 



(2.5.22) 



So far, this is just a curious mathematical exercise. To see why it is relevant to 
the simple harmonic oscillator, consider the derivative of the generating function 
with respect to t. If we start with (2.5.17a), then 



dg_ 

8t 



= -2tg(x,t) + 2xg(x,t) 



n = 0 
oo 



00 t n+\ 00 

= -^2// n (x)— + J2 2xH n(x) 



n = 0 
oo 



t n 

*n\ 



t n t n 

= -J2 2nH n-l(x)-^ + J2 2xH n( x )-^- (2.5.23) 



n = 0 n = 0 

Or, if we differentiate (2.5.17b), then wehave 

9g ~ t n~ 1 ~ t " 

= TnHn(x )— r- = y>n+l(*)-r. 
dt n\ ^ n\ 

n = 0 n = 0 

Comparing (2.5.23) and (2.5.24) gives us the recursion relation 
tfn+lOO = 2 xH n {x) - 2nH n -\{x), 



(2.5.24) 



(2.5.25) 




108 



Chapter 2 Quantum Dynamics 



which we combine with (2.5.22) to find 

HZ(x) = 2n-2(n-l)H n - 2 (x) 

= 2 n [2x H n _ i (x ) — // n (x)] 

= 2xH' n (x) — 2nH n {x). (2.5.26) 

In other words, the Hermite polynomials satisfy the differential equation 

H„(x) — 2xH' n (x) + 2n H n (x ) = 0, (2.5.27) 

where n is a nonnegative integer. This, however, is the same as the Schrodinger 
equation written as (2.5.16) since s — 1 = 2 n. That is, the wave functions for the 
simple harmonic oscillator are given by 



Unix) — c n H n 




g— mo)x 2 /Ih 



(2.5.28) 



up to some normalization constant c n . This constant can be determined from the 
orthogonality relationship 




H n (x)H m (x)e~ x 



— 7t 1 / 2 2”u !S n 



(2.5.29) 



which is easily proved using the generating function. See Problem 2.21 at the end 
of this chapter. 

Generating functions have a usefulness that far outreaches our limited appli- 
cation here. Among other things, many of the orthogonal polynomials that arise 
from solving the Schrodinger equation for different potentials can be derived from 
generating functions. See, for example, Problem 3.22 in Chapter 3. The interested 
reader is encouraged to pursue this further from any one of the many excellent 
texts on mathematical physics. 



The Linear Potential 

Perhaps the first potential-energy function, with bound States, to come to mind is 
the linear potential, namely 



V(jc) = lt|jc|, (2.5.30) 

where k is an arbitrary positive constant. Given a total energy E, this potential 
has a classical turning point at a value x = a, where E = ka. This point will be 
important for understanding the quantum behavior of a particle of mass m bound 
by this potential. 




2.5 Elementary Solutions to Schrodinger's Wave Equation 



109 



The Schrodinger equation becomes 
ti 2 d 2 UE 

—r- + k\x\uE(x)- Eue(x). (2.5.31) 

Im dx z 

It is easiest to deal with the absolute value by restricting our attention to x > 0 . We 
can do this because V(—x) — V (x ), so there are two types of solutions, namely 
ue{—x) — ±ue(x). In either case, we need ue{x) to tend toward zero as x — > oo. 
If ue{—x ) = —ue(x), then we need ue( 0) = 0. On the other hånd, if ue(—x) — 
+ue(x), then we have u' E ( 0) = 0, because «£(e) — ue(— e) = 0, even for e -» 0. 
(As we will discuss in Chapter 4, we refer to these solutions as “odd” and “even” 
parity.) 

Once again, we write the differential equation in terms of dimensionless vari- 
ables, based on appropriate scales for length and energy. In this case, the dimen- 
sionless length scale is xo — (ti 2 /mk) 1 / 3 and the dimensionless energy scale is 
Eq — kx o = (ti 2 k 2 / m) 1 / 3 . Defining y =x/xq and s = E/Eq enablesus to rewrite 
(2.5.31) as 



— 2(y — e)Mjs(y) = 0 y>0. (2.5.32) 

dy 1 

Notice that y = s when x — E/ k — that is, the classical tuming point x — a. In f act, 
when we define a translated position variable z = 2 1 / 3 (y — s), (2.5.32) becomes 



d 2 UE 
dz 2 



-zue(z) = 0 



(2.5.33) 



This is the Airy equation, and the solution is the Airy function Ai(z) plotted in 
Figure 2.3. The Airy function has a peculiar behavior, oscillatory for negative 
values of the argument and decreasing rapidly toward zero for positive values. Of 
course, this is exactly the behavior we expect for the wave function, since z — 0 
is the classical tuming point. 




z 



FIGURE 2.3 The Airy function. 




110 



Chapter 2 Quantum Dynamics 



Note that the boundary conditions at x = 0 translate into zeros for either Ai ; (z) 
or Ai(z), where z — — 2 1 / 3 e. In other words, the zeros of the Airy function or its 
derivative determine the quantized energies. One finds that 

Ai'(z) = 0 for z = -1.019, -3.249, -4.820,... (even), (2.5.34) 

Ai(z) = 0 for z = —2.338, -4.088, -5.521,... (odd). (2.5.35) 

For example, the ground-state energy is E — (1.019/2 1 / 3 )(^ 2 Æ 2 /m) 1 / 3 . 

The quantum-theoretical treatment of the linear potential may appear to have 
little to do with the real world. It turns out, however, that a potential of type 
(2.5.30) is actually of practical interest in studying the energy Spectrum of a quark- 
antiquark bound system called quarkonium. In this case, the x in (2.5.30) is re- 
placed by the quark-antiquark separation distance r. This constant k is empirically 
estimated to be in the neighborhood of 

1 GeV/fm ~ 1.6 x 10 5 N, (2.5.36) 

which corresponds to a gravitational force of about 16 tons. 

Indeed, another real-world example of the linear potential is the “bouncing 
hall.” One interprets (2.5.30) as the potential energy of a hall of mass mata height 
x above the floor, and k = mg, where g is the local acceleration due to gravity. Of 
course, this is the potential energy only for x > 0 as there is an infinite potential 
barrier that causes the hall to “bounce.” Quantum-mechanically, this means that 
only the odd parity solutions (2.5.35) are allowed. 

The bouncing hall happens to be one of those rare cases where quantum- 
mechanical effects can be observed macroscopically. The trick is to have a very 
low-mass “hall,” which has been achieved with neutrons by a group* working 
at the Institut Laue-Langevin (ILL) in Grenoble, France. For neutrons with m = 
1.68 x 10 -27 kg, the characteristic length scale is xq = (Ti 2 / m 2 g) 1 ^ = 7.40 fim. 
The “allowed heights” to which a neutron can bounce are (2.338/2 1 / 3 )xo = 
14 gm, (4.088/2 1 / 3 )xo = 24 gm, (5.521/2 1 / 3 )xo = 32 /im, and so on. These are 
small (but measurable with precision mechanical devices) and very low-energy 
(aka “ultracold”) neutrons. The experimenters’ results are shown in Figure 2.4. 
Plotted is the detected neutron rate as a function of the height of a slit that allows 
neutrons to pass only if they exceed this height. No neutrons are observed unless 
the height is at least ~ 14 gm, and clear breaks are observed at ~ 24 gm and 
~ 32 fim, in excellent agreement with the predictions of quantum mechanics. 

The WKB (Semiclassical) Approximation 

Having solved the problem of a linear potential, it is worthwhile to introduce an 
important approximation technique known as the WKB solution, after G. Wentzel, 
A. Kramers, and L. Brillouin. ' This technique is based on making use of regions 

*See V. V. Nesvizhevsky et al., Phys. Rev. D 67 (2003) 102002, and V. V. Nesvizhevsky et al., 
Eur. Phys. J. C 40 (2005) 4792005. 

' A similar technique was used earlier by H. Jeffreys; this solution is referred to as the JWKB 
solution in some English books. 




2.5 Elementary Solutions to Schrodinger's Wave Equation 



111 




i . i ; i j i t_ 

0 10 20 30 40 



Ah, /zm 

FIGURE 2.4 Experimental observation of the quantum-mechanical States of a bounc- 
ing neutron, from V. V. Nesvizhevsky et al., Phys. Rev. D 67 (2003) 102002. The solid 
curve is a fit to the data based on classical physics. Note that the vertical scale is loga- 
rithmic. 



where the wavelength is much shorter than the typical distance over which the 
potential energy varies. Such is never the case near classical tuming points, but 
this is where the linear potential solution can be used to join the solutions on either 
side of them. 

Again restricting ourselves to one dimension, we write Schrodinger’s wave 
equation as 



d 2 UE 
dx 2 



+ —7r(E - V(x))u E (x) = 0. 
7r 



(2.5.37) 



Define the quantities 



k(x) = 



2 m 



7 1/2 



k(x) — —ix{x) = —i 



'<E - V(x)) 



2 m 

- t (V(x)-E) 
h 



iorE>V(x) and (2.5.38a) 



1/2 



for E < V(x), 



(2.5.38b) 



and so (2.5.37) becomes 



d 2 UE 

dx 2 



+ [k(x)f ue{x) = 0. 



(2.5.39) 



Now, if V (x) were not changing with x, then k(x) would be a constant, and u(x) oc 
exp(± ikx) would solve (2.5.39). Consequently, if we assume that V (x) varies 





112 



Chapter 2 Quantum Dynamics 



only “slowly” with x, then we are tempted to try a solution of the form 

ue(x) = exp[/W(;e)/ft] . (2.5.40) 

(The reason for including the ti will become apparent at the end of this section, 
when we discuss the physical interpretation of the WKB approximation.) In this 
case, (2.5.39) becomes 



i ti 



d 2 W 
dx 2 




+ ti 2 [k(x)] 2 = 0, 



(2.5.41) 



which is completely equivalent to Schrodinger’s equation, although rewritten in 
what appears to be a nasty form. However, we consider a solution to this equation 
under the condition that 



ti 



d 2 W 



dx 2 







(2.5.42) 



This quantifies ournotion of a “slowly varying” potential V (x ), and we will return 
soon to the physical significance of this condition. 

Forging ahead for now, we use the condition (2.5.42) with our differential 
equation (2.5.41) to write a lowest-order approximation for W(x), namely 

Wq(x) = ±tik(x), (2.5.43) 

leading to a first-order approximation for W (x), based on 

(~Zc^) = ^ 2 ik(x)] 2 + itiWQ(x) 

= ti 2 [k(x)] 2 ± iti 2 k'{x), (2.5.44) 



where the second term in (2.5.44) is much smaller than the first, so that 

r -i 1 ^2 

dx'Yk\x')±ik'{x')\ 

*±nj 

c x i 

= ±ti / dx'k{x') + -ti\n[k(x)]. (2.5.45) 

The WKB approximation for the wave function is given by (2.5.40) and the first- 
order approximation for (2.5.45) for W(x), namely 



ue(x) ^ exp [/ W(x)/ti\ 



[kix)^ 2 ^ 





dx'k(x') . 



(2.5.46) 



Note that this specifies a choice of two solutions (±) in either the region where 
E > V(x), with k(x) from (2.5.38a), or the region where E < V(x), with k(x) 




2.5 Elementary Solutions to Schrodinger's Wave Equation 



113 



i il III 




FIGURE 2.5 Schematic diagram for behavior of wave function u e (x ) in potential well 
V(x ) with tuming points x\ and xo. Note the similarity to Figure 2.3 near the tuming 
points. 



from (2.5.38b). Joining these two solutions across the classical turning point is 
the next task. 

We do not discuss this joining procedure in detail, because it is discussed in 
many places (Schiff 1968, pp. 268-76, or Merzbacher 1998, Chapter 7, for exam- 
ple). Instead, we content ourselves with presenting the results of such an analysis 
for a potential well, schematically shown in Figure 2.5, with two turning points, x\ 
and X 2 . The wave function must behave like (2.5.46), with k(x) given by (2.5.38a) 
in region II and by (2.5.38b) in regions I and III. The solutions in the neighbor- 
hood of the turning points, shown as a dashed line in Figure 2.5, are given by Airy 
functions, because we assume a linear approximation to the potential in these re- 
gions. Note that the asymptotic dependences of the Airy function* are 



Ai(z) 



2 Jn 



z 1/4 exp 



_? 7 3/2 

3 



1/4cos (jl z l 3/2 ~ J 



z — > Too (2.5.47a) 

z ^ -oo (2.5.47b) 



For connecting regions I and II, the correct linear combination of the two solu- 
tions (2.5.46) is determined by choosing the integration constants in such a way 
that 



j TeTTJ } exp [' G) f 

- 1 [ B - i c ° s [(£) r ix '^ [£ - y(xoi - ?, • 



(2.5.48) 



*There is actually a second Airy function, Bi(z), which is very similar to Ai(z) but is singular at 
the origin. It is relevant to this discussion, but we are glossing over the details. 




114 



Chapter 2 Quantum Dynamics 



Likewise, from region III into region II we have 



1 



[V(ji :)-£] 1/4 

2 



exp 



[E — V (x)] 1 / 4 



— cos 



dx'y/2m[V(x')- E] 



lx 2 



f dx ' yflm [E - V (V)] + ^ 



(2.5.49) 



Of course, we must obtain the same form for the wave function in region II, re- 
gardless of which turning point is analyzed. This implies that the arguments of 
the cosine in (2.5.48) and (2.5.49) must differ at most by an integer multiple of 
tt [not of 27 r, because the signs of both sides of (2.5.49) can be reversed]. In this 
way we obtain a very interesting consistency condition: 



px2 

/ dx^lm [E — V(x)] = (n + ^ W/i (n = 0, 1,2,3,...). (2.5.50) 

J X\ ' / 

Apart from the difference between n + j and n, this equation is simply the quan- 
tization condition of the old quantum theory that A. Sommerfeld and W. Wilson 
originally wrotein 1915 as 



pdq = nh, 



(2.5.51) 



where h is Planck’s h, not Dirac’s h, and the integral is evaluated over one whole 
period of classical motion, from x\ to X 2 and back. 

Equation (2.5.50) can be used to obtain approximate expressions for the en- 
ergy levels of a particle confined in a potential well. As an example, we consider 
the energy Spectrum of a ball bouncing up and down over a hard surface, the 
“bouncing neutrons” discussed earlier in this section, namely 



V - 



oo, 



mgx. 



forx > 0 
forx < 0, 



(2.5.52) 



where x stands for the height of the ball measured from the hard surface. One 
might be tempted to use (2.5.50) directly with 



XI =0, X2- — , 

mg 



(2.5.53) 



which are the classical turning points of this problem. We note, however, that 
(2.5.50) was derived under the assumption that the WKB wave function “leaks 
into” the x < x\ region, while in our problem the wave function must strictly 
vanish for x < x\ =0. A much more satisfactory approach to this problem is 
to consider the odd-parity solutions — those guaranteed to vanish at x = 0 — of a 
modified problem defined by 



V{x) = mg\x\ (— oo < x < oo) 



(2.5.54) 




2.5 Elementary Solutions to Schrodinger's Wave Equation 



115 



whose turning points are 



E 



x\ = - 



mg ’ 




(2.5.55) 



The energy Spectrum of the odd-parity States for this modified problem must 
clearly be the same as that of the original problem. The quantization condition 
then becomes 




dxy/2m(E — mg\x \ ) = 



^U 0 dd T 2 ^ (^odd — 



1,3,5,...) (2.5.56) 



or, equivalently, 

-Ej mg 



J dxy]2m(E — mgx) = yi — nti {n = 1,2,3, 4,...). 
This integral is elementary, and we obtain 



(2.5.57) 



En — 



1 2/3 



(mg V) 1/3 



(2.5.58) 



for the quantized energy levels of the bouncing ball. 

Table 2.2 compares the WKB approximation to the exact solution, using ze- 
ros of the Airy function, for the first 10 energy levels. We see that agreement is 
excellent even for small values of n and is essentially exact for n ~ 10. 

Before concluding, let us return to the interpretation of the condition (2.5.42). 
It is exact in the case ti — > 0, which suggests a connection between the WKB 
approximation and the classical limit. In faet, when we use (2.5.40), the time- 
dependent wave function becomes 

ip(x,t) oc M£(x)exp(— i Et j ti) = exp (iW(x)/h — i Et /ti). (2.5.59) 



TABLE 2.2 The Quantized Energies of a Bouncing Ball in Units of (mg 2 ti 2 / 2) 1 / 3 



n 


WKB 


Exact 


1 


2.320 


2.338 


2 


4.082 


4.088 


3 


5.517 


5.521 


4 


6.784 


6.787 


5 


7.942 


7.944 


6 


9.021 


9.023 


7 


10.039 


10.040 


8 


11.008 


11.009 


9 


11.935 


11.936 


10 


12.828 


12.829 




Chapter 2 Quantum Dynamics 



Comparing this to (2.4.18) and (2.4.28), we see that W(x) corresponds directly 
to Hamilton’s characteristic function. Indeed, condition (2.5.42) is the same as 
(2.4.26), the condition for reaching the classical limit. For these reasons, the WKB 
approximation is frequently ref erred to as a “semiclassical” approximation. 

We also note thatcondition (2.5.42) is equivalent to \k'(x)\ <?C \k 2 (x)\. In terms 
of the de Broglie wavelength divided by 2n, this condition amounts to 

h 2[E-V(x)] 

X “ V2 m [ E - V(xfi ^ \dV/dx\ (2.5.60) 

In other words, X must be small compared with the characteristic distance over 
which the potential varies appreciably. Roughly speaking, the potential must be 
essentially constant over many wavelengths. Thus we see that the semiclassical 
picture is reliable in the short-wavelength limit. 



2.6 ■ PROPAGATORS AND FEYNMAN PATH INTEGRALS 



Propagators in Wave Mechanics 

In Section 2.1 we showed how the most general time-evolution problem with a 
time-independent Hamiltonian can be solved once we expand the initial ket in 
terms of the eigenkets of an observable that commutes with H. Let us translate 
this statement into the language of wave mechanics. We start with 



I a,to;t) — exp 



-iH(t-to) 

h 



= ^\a')(a'\a,to)exp 



a, to) 

j Eql(t - to Y 
h 



( 2 . 6 . 1 ) 



Multiplying both sides by (x'| on the left, we have 



(x|cMo;0 = ^(x / |a / )(« / |aQo)exp 
a' 

which is of the form 

VKx',0 = ^c a /(lo)M a /(x')exp 



-i E a i(t - tp ) ' 
h 



( 2 . 6 . 2 ) 



-iEqht -to) 

h 



(2.6.3) 



with 



M a /(X') = (XV) 



(2.6.4) 



standing for the eigenfunction of operator A with eigenvalue a'. Note also that 




2.6 Propagators and Feynman Path Integrals 



117 



which we recognize as the usual rule in wave mechanics for getting the expansion 
coefficients of the initial State: 

cAto) = I (2.6.6) 

All this should be straigh tforward and familiar. Now (2.6.2) together with 
(2.6.5) can also be visualized as some kind of integral operator acting on the 
initial wave function to yield the final wave function: 

i lf(x",t) = J d 3 x'K(x",f,x',to)\l/(x , ,to). (2.6.7) 

Here the kernel of the integral operator, known as the propagator in wave me- 
chanics, is given by 



K(x”,f,x',to) = J^(x"|fl / )(a / |x / )exp 

a' 



-iEq'(t-t0) 

fl 



( 2 . 6 . 8 ) 



In any given problem the propagator depends only on the potential and is inde- 
pendent of the initial wave function. It can be constructed once the energy eigen- 
functions and their eigenvalues are given. 

Clearly, the time evolution of the wave function is completely predicted if 
K (x" , t\ x! , to) is known and i/r (x' , to) is given initially. In this sense S chrodinger’ s 
wave mechanics is a perfectly causal theory. The time development of a wave 
function subjected to some potential is as “deterministic” as anything else in clas- 
sical mechanics provided that the system is left undisturbed. The only peculiar 
feature, if any, is that when a measurement intervenes, the wave function changes 
abruptly, in an uncontrollable way, into one of the eigenfunctions of the observ- 
able being measured. 

There are two properties of the propagator worth recording here. First, for 
t > to, K(x",t;x',to) satisfies Schrodinger’s time-dependent wave equation in 
the variables x" and t, with x' and to fixed. This is evident from (2.6.8) be- 
cause {x''\a') exp[—i E a f(t — to)/ ti], being the wave function corresponding to 
U(t,to)\a'), satisfies the wave equation. Second, 

lim K(x'',p,x',to) = sV'-x'), (2.6.9) 

f->fo 



which is also obvious; as t — > to, because of the completeness of da')}, sum 
(2.6.8) just reduces to (x"|x / ). 

Because of these two properties, the propagator (2.6.8), regarded as a function 
of x", is simply the wave function at t of a particle that was localized precisely 
at x' at some earlier time to. Indeed, this interpretation follows, perhaps more 
elegantly, from noting that (2.6.8) can also be written as 



K(x'',f,x',to)= (x"| exp 



-iHit~to) 



|A 



h 



( 2 . 6 . 10 ) 




118 



Chapter 2 Quantum Dynamics 



where the time-evolution operator acting on |x') is just the State ket at t of a system 
that was localized precisely at x' at time to (< t). If we wish to solve a more 
general problem where the initial wave function extends over a finite region of 
space, all we have to do is multiply ip(x\to) by the propagator K(x",t\x',to) and 
integrate over all space (that is, over x'). In this manner we can add the various 
contributions from different positions (x'). This situation is analogous to one in 
electrostatics; if we wish to find the electrostatic potential due to a general charge 
distribution p(x'), we first solve the point-charge problem, multiply the point- 
charge solution by the charge distribution, and integrate: 

0(x)= [d 3 x'-^~. (2.6.11) 

J |x-x'| 

The reader familiar with the theory of the Green’s functions must have recog- 
nized by this time that the propagator is simply the Green’s function for the time- 
dependent wave equation satisfying 



— ) V //2 + V(x // ) — ih — K(x!',f,x',to) = -ifiS 2 (x” -x')8(t - to) 
2 m dt 



with the boundary condition 



K(x",f,x',to) = 0, fort < to. 



The delta function 8(t — to) is needed on the right-hand side of (2.6. 12) because K 
varies discontinuously at t = to. 

The particular form of the propagator is, of course, dependent on the particular 
potential to which the particle is subjected. Consider, as an example, a free particle 
in one dimension. The obvious observable that commutes with H is momentum; 
I/?') is a simultaneous eigenket of the operators p and H\ 

P\p') = P'\P') H\ p') = (j£) | p'). (2.6.14) 



The momentum eigenfunction is just the transformation function of Section 1.7 
[see (1 .7.32)] which is of the plane-wave form. Combining everything, we have 



K(x" ,t\x' ,t 0 ) 




ip ,2 (t-to) ~ 
2 mti 



(2.6.15) 



The integral can be evaluated by completing the square in the exponent. Here we 
simply record the result: 



K(x",f,x',to) = 



2ni Ti(t — to) 



im(x" — x') 2 
2 h(t - to) 



This expression may be used, for example, to study how a Gaussian wave packet 
spreads out as a function of time. 




2.6 Propagators and Feynman Path Integrals 



119 



For the simple harmonic oscillator, where the wave function of an energy 
eigenstate is given by 



u n (x)ex p 




/ 1 \ /mco\ 1 ^ ( —mcox 2 \ 

exp l^-J 




the propagator is given by 



(2.6.17) 



K(x",t;x',to) 



mco 



2nihsm[cL>(t — to)] 



exp 



imco 

I 2 h sin[o;(r — to)] 



x (x" 2 +x' 2 )cos[o)(t — to)] — 2x"x’) . 



(2.6.18) 



One way to prove this is to use 






exp 



-(g 2 + ^-2^0 ' 

(l-? 2 ) 



= exp [-(£ 2 + rj 2 )] H n &Hn(ri), 

n= O ' U ' ' 



(2.6.19) 



which is found in books on special functions (Morse and Feshbach 1953, p. 786). 
It can also be obtained using the a, a" operator method (Saxon 1968, pp. 144- 
45) or, alternatively, the path-integral method to be described later. Notice that 
(2.6.18) is a periodic function of t with angular frequency co, the classical oscil- 
lator frequency. This means, among other things, that a particle initially localized 
precisely at x! will return to its original position with certainty at 27r /&> (4 jt / co, 
and so forth) later. 

Certain space and time integrals derivable from K(x" ,t\x',to) are of consider- 
able interest. Without loss of generali ty, we set to = O in the following. The first 
integral we consider is obtained by setting x" = x! and integrating over all space. 
We have 



G(t)= / d 3 x' K(x',t\x' , 0) 



= J r/V^|(xV)| 2 exp^ 



E l —iE a 't 
exp 



( 2 . 6 . 20 ) 



This result is anticipated; recalling (2.6.10), we observe that setting x! = x" and 
integrating are equivalent to taking the trace of the time-evolution operator in 
the x-representation. But the trace is independent of representations; it can be 




120 



Chapter 2 Quantum Dynamics 



evaluated more readily using the (la 7 )} basis where the time-evolution operator 
is diagonal, which immediately leads to the last line of (2.6.20). Now we see 
that (2.6.20) is just the “sum over States,” reminiscent of the partition function 
in statistical mechanics. In faet, if we analytically continue in the t- variable and 
make t purely imaginary, with fi defined by 

P = T ( 2 . 6 . 21 ) 

n 

real and positive, we can identify (2.6.20) with the partition function itself: 

Z = y^exp(— fiE a '). (2.6.22) 

a' 



For this reason some of the techniques encountered in studying propagators in 
quantum mechanics are also useful in statistical mechanics. 

Next, let us consider the Laplace-Fourier transform of G(t): 



G(E)= —i / dtG(t)exp(i Et /h)/h 

I o 



/»OO 

= —i I dt\ ' exp(—iE a 't/h)exp(iEt/h)/h. 

Jo , 



(2.6.23) 



The integrand here oscillates indefinitely. But we can make the integral meaning- 
ful by letting E acquire a small positive imaginary part: 



E -> E T is. 



(2.6.24) 



We then obtain, in the limit s — > 0, 

<3 < £ ) = E^;- < 2 «- 25 > 

a! 

Observe now that the complete energy Spectrum is exhibited as simple poles of 
G(E) in the complex £-plane. If we wish to know the energy Spectrum of a phys- 
ical system, it is sufficient to study the analytic properties of G{E). 



Propagator as a Transition Amplitude 

To gain further insight into the physical meaning of the propagator, we wish to 
relate it to the concept of transition amplitudes introduced in Section 2.2. But first, 
recall that the wave function, which is the inner product of the fixed position bra 
(x'| with the moving State ket \a,to\t), can also be regardedas the inner product of 
the Heisenberg-picture position bra (x',/1, which moves “oppositely” with time, 
with the Heisenberg-picture State ket |a,to)> which is fixed in time. Likewise, the 




2.6 Propagators and Feynman Rath Integrals 121 

propagator can also be written as 

K(x",f,x' ,to) = 7>V)(flV)exp 

a' 

ST, ", f~ iHt \, /w /, f iHt o\, / x ( 2 . 6 . 26 ) 

= 2J X i ex P J i« ) (« i ex P J |x> 

= (x",r|x',t 0 >, 

where |x',to) and (x",f | are to be understood as an eigenket and an eigenbra of 
the position operator in the Heisenberg picture. In Section 2.1 we showed that 
{b',t\a'), in the Heisenberg-picture notation, is the probability amplitude for a 
system originally prepared to be an eigenstate of A with eigenvalue a' at some 
initial time to = 0 to be found at a later time t in an eigenstate of B with eigenvalue 
b', and we called it the transition amplitude for going from State | a') to State | b'). 
Because there is nothing special about the choice of to — only the time difference 
t — to is relevant — we can identify (x", t |x', to ) as the probability amplitude for the 
particle prepared at to with position eigenvalue x' to be found at a later time t at 
x". Roughly speaking, (x",t|x',to) is the amplitude for the particle to go from a 
space-time point (x',to) to another space-time point (x",t), so the term transition 
amplitude forthis expression is quite appropriate. This interpretation is, of course, 
in complete accord with the interpretation we gave earlierfor K(x",p,x',to). 

Yet another way to interpret (x",t|x',to) is as follows. As we emphasized 
earlier, |x',/o) is the position eigenket at to with the eigenvalue x' in the Heisen- 
berg picture. Because at any given time the Heisenberg-picture eigenkets of an 
observable can be chosen as base kets, we can regard (x",t|x',to) as the transfor- 
mation function that connects the two sets of base kets at dijferent times. So in the 
Heisenberg picture, time evolution can be viewed as a unitary transformation, in 
the sense of changing bases, that connects one set of base kets formed by { | x' , ?o ) } 
to another formed by { | x /r , ? ) } . This is reminiscent of classical physics, in which 
the time development of a classical dynamic variable such as x(t) is viewed as 
a canonical (or contact) transformation generated by the classical Hamiltonian 
(Goldstein 2002, pp. 401-2). 

It turns out to be convenient to use a notation that treats the space and time 
coordinates more symmetrically. To this end we write {x",t''\x',t') in place of 
{x" ,t\x' ,to). Because at any given time the position kets in the Heisenberg picture 
form a complete set, it is legitimate to insert the identity operator written as 

J d 3 x"\x",t")(x",t"\ = 1 (2.6.27) 

at any place we desire. For example, consider the time evolution from t' to t"’\ by 
dividing the time interval ( t',t "') into two parts, (f ,t") and (t",t"'), we have 

(x'W/) = J d 3 x"(x"',t"'\x",t")(x",t"\x',t'), 

it"' > t" > t'). 



-iEgft-to) 

h 



( 2 . 6 . 28 ) 




122 



Chapter 2 Quantum Dynamics 



We call this the composition property of the transition amplitude * Clearly, we 
can divide the time interval into as many smaller subintervals as we wish. We 
have 



{x"",t""\x',t') = / d 3 x'" / d 3 x"{x! , ",t""\x"',t"'){x"',t"'\x! , ,t") 



x(x",t"\x',t'), (?"" > r > t" > (2.6.29) 



andso on. Ifwe somehow guess the form of {x",t"\x',t') for an infinitesimal time 
interval (between t' and t" = t' + dt), we should be able to obtain the amplitude 
{x",t”\x',t') for a finite time interval by compounding the appropriate transition 
amplitudes for infinitesimal time intervals in a manner analogous to (2.6.29). This 
kind of reasoning leads to an independent formulation of quantum mechanics that 
R. R Feynman published in 1948, to which we now turn our attention. 



Path Integrals as the Sum Over Paths 

Without loss of generality we restrict ourselves to one-dimensional problems. 
Also, we avoid awkward expressions like 

x"" • • • x" 

N times 

by using notation such as xn- With this notation we consider the transition am- 
plitude for a particle going from the initial space-time point {x\,t\) to the final 
space-time point (xnJn). The entire time interval between t\ and t/v is divided 
into N — 1 equal parts: 



tj — tj- 1 = At = 



(tN~t i) 



(2.6.30) 



(N — 1) 

Exploiting the composition property, we obtain 

(XN,tN\xi,t\) = j dXN-l j dXN-2---J dX2(XN,tN\XN-l>tN-l) 

X {x N -i,t N -i\x N -2,tN-2) ■ ■ • {X2,t2\*l,tl) ■ (2.6.31) 



To visualize this pictorially, we consider a space-time plane, as shown in Fig- 
ure 2.6. The initial and final space-time points arefixed to be (x\,t\) and 
respectively. For each time segment, say between t n -\ and t n , we are instructed to 
consider the transition amplitude to go from ( x n -i,tn-\ ) to (x n ,t n )\ we then inte- 
grate over *2, * 3 , • • • ,xn- 1 • This means that we must sum over all possible paths 
in the space-time plane with the end points fixed. 

Before proceeding further, it is profitable to review here how paths appear in 
classical mechanics. Suppose we have a particle subjected to a force field deriv- 

*The analogue of (2.6.28) in probability theory is known as the Chapman-Kolmogoroff equation, 
and in diffusion theory as the Smoluchowsky equation. 




2.6 Propagators and Feynman Path Integrals 



123 




able from a potential V(x). The classical Lagrangian is written as 

• 7 
fflX 

Lclassical(x,x) = — — V(x). (2.6.32) 

Given this Lagrangian with the end points (x\,t\) and (xnJn) specified, we do 
not consider just any path joining (xiTi) and (x^Jn) in classical mechanics. On 
the contrary, there exists a unique path that corresponds to the actual motion of 
the classical particle. For example, given 



V(x) = mgx, (xut\) = (h,0), (xnJn) 




(2.6.33) 



\ V 8 / 



where h may stand for the height of the Leaning Tower of Pisa, the classical path 
in the xr-plane can only be 



gt ^ 

x=h-Z-, (2.6.34) 

More generally, according to Hamilton’s principle, the unique path is that which 
minimizes the action, defined as the time integral of the classical Lagrangian: 



S I dtL c i aS si ca i(x,i) — 0, (2.6.35) 

•Ih 

from which Lagrange’s equation of motion can be obtained. 



Feynman's Formulation 

The basic difference between classical mechanics and quantum mechanics should 
now be apparent. In classical mechanics a definite path in the vt-plane is asso- 
ciated with the particle’s motion; in contrast, in quantum mechanics all possible 




124 



Chapter 2 Quantum Dynamics 



paths must play roles, including those that do not bear any resemblance to the 
classical path. Yet we must somehow be able to reproduce classical mechanics in 
a smooth manner in the limit h —> 0. How are we to accomplish this? 

As a young graduate student at Princeton University, R. P. Feynman tried to 
attack this problem. In looking for a possible clue, he was said to be intrigued by a 
mysterious remark in Dirac’s book that, in our notation, amounts to the following 
statement: 



exp 



i 



1 2 dt L c \ aæ \ ca \(x ,X) 

h 



corresponds to (X2,t2\x\,t\). 



Feynman attempted to make sense out of this remark. Is “corresponds to” the same 
thing as “is equal to” or “is proportional to”? In so doing he was led to formulate 
a space-time approach to quantum mechanics based on path integrals. 

In Feynman ’s formulation the classical action plays a very important role. For 
compactness, we introduce a new notation: 



S(n,n 1) = / dt L c iassica\(x,x). (2.6.36) 

Jt n - 1 



Because Tciassical is a function of x and i, S(n,n — 1) is definedonly after adefinite 
path is specified along which the integration is to be carried out. So even though 
the path dependence is not explicit in this notation, it is understood that we are 
considering a particular path in evaluating the integral. Imagine now that we are 
following some prescribed path. We concentrate our attention on a small segment 
along that path, say between (x n -\,t n -\) and (x n ,t n ). According to Dirac, we are 
instructed to associate exp [iS(n,n — 1 )/h\ with that segment. Going along the 
definite path we are set to follow, we succes si vely multiply expressions of this 
type to obtain 



N 

n ex p 



n= 2 



iS(n,n — 1) 




7 /\A c , 




~iS(N, 1)" 


h 


= exp 


. V / 71=2 


= exp 


h 



(2.6.37) 



This does not yet give (xnJnIxiJi)', rather, this equation is the contribution to 
{x]\[,t]\[\x\,t\) arising from the particular path we have considered. We must still 
integrate overx 2 ,X 3 ,. . . , jc/v— i . At the same time, exploiting the composition prop- 
erty, we let the time interval between t n -\ and t n be infinitesimally small. Thus 
our candidate expression for {xn , tN\x\,t\) may be written, in some loose sense, 
as 



{x N ,t N \xi,ti) 



■ ex P 

all paths 



iS(N,iy 

h 



(2.6.38) 



where the sum is to be taken over an innumerably infinite set of paths! 

Before presenting a more precise formulation, let us see whether considera- 
tions along this line make sense in the classical limit. As h — »■ 0, the exponential 




2.6 Propagators and Feynman Path Integrals 



125 




(%• { n) 



FIGURE 2.7 Paths important in the ft 0 limit. 



in (2.6.38) oscillates very violently, so thereis a tendency for cancella tion among 
various contributions from neighboring paths. This is because exp[t S/ft] for some 
definite path and exp[r S / ft ] f or a slightly different path have very different phases 
as a consequence of the smallness of ft . So most paths do not contribute when ft 
is regarded as a small quantity. However, there is an important exception. 

Suppose that we consider a path that satisfies 

SS(N, 1) = 0, (2.6.39) 

where the change in S is due to a slight deformation of the path with the end 
points fixed. This is precisely the classical path by virtue of Hamilton’s principle. 
We denote the S that satisfies (2.6.39) by S m i n . We now attempt to deform the 
path a little bit from the classical path. The resulting S is still equal to S m in to 
first order in deformation. This means that the phase of exp[/S/ft] does not vary 
very much as we deviate slightly from the classical path even if ft is small. As a 
result, as long as we stay near the classical path, constructive interferencebetween 
neighboring paths is possible. In the ft — > 0 limit, the major contributions must 
then arise from a very narrow strip (or a tube in higher dimensions) containing the 
classical path, as shown in Figure 2.7. Our (or Feynman’s) guess based on Dirac’s 
mysterious remark makes good sense because the classical path gets singled out 
in the ft — > 0 limit. To formulate Feynman’s conjecture more precisely, let us go 
back to {x n ,t n \x n -\,t n -\), where the time difference t n — t n ~ i is assumed to be 
infinitesimally small. We write 



{Xn , t n \x n — l , t n —\ ) — 



' 1 ' 
w(At)_ 



exp 



iS(n,n — 1) 
~ ft 



(2.6.40) 



where we evaluate S(n,n — 1) in a moment in the At — > 0 limit. Notice that we 
have inserted a weight factor, l/w(At), which is assumed to depend only on the 
time interval t n — t n ~\ and not on V (x). That such a factor is needed is clear 
from dimensional considerations; according to the way we normalized our posi- 
tion eigenkets, {x n ,t n \x n -\,t n -i) must have the dimension of 1/length. 




126 



Chapter 2 Quantum Dynamics 



We now look at the exponential in (2.6.40). Our task is to evaluate the At —> 0 
limit of S(n,n — 1). Because the time interval is so small, it is legitimate to make a 
straight-line approximation to thepath joining (x„_i,t„_i) and (x n , t n ) as follows: 



S{n,n — 1) = / 
Jt „- 1 



dt 



■ 2 
mx 



= At' 



(?) 



-V(x) 



( x n x n— l) 

At 



- 1 / 



(x n +x„_i) 



(2.6.41) 



As an example, we consider specifically the free-particle case, V = 0. Equation 
(2.6.40) now becomes 



{ x ndn\ x n—Utn—\) — 



w(At) 



exp 



im(x n — x n -\Y 
2hAt 



(2.6.42) 



We see that the exponent appearing here is identical to the one in the expression 
for the free-particle propagator (2.6.16). The reader may work out a similar com- 
parison for the simple harmonic oscillator. 

We remarked earlier that the weight factor l/w(At) appearing in (2.6.40) is 
assumed to be independent of V (x ), so we may as well evaluate it for the free 
particle. Noting the orthonormality, in the sense of <5-function, of Heisenberg- 
picture position eigenkets at equal times, 



( x ndii\ x n — litn—l)\t n =t n -i — $( x n x n— 1)> (2.6.43) 



we obtain 



1 I m 
w(At) V 2nihAt’ 



where we have used 



exp 



im% 2 \ 

2fiAt ) ~ 



2nifiAt 



m 



and 



lim 

Af-+0 



m 



2niTiAt 



exp 



im% 2 \ 
2hAt ) 



=m. 



(2.6.44) 



(2.6.45a) 



(2.6.45b) 



This weight factor is, of course, anticipated from the expression for the free- 
particle propagator (2.6. 16). 

To summarize, as At — »■ 0, we are led to 



m 



exp 



iS(n,n — 1) 



{Xn,tn\x n —\,tn—\) — 



2nifiAt 



ti 



(2.6.46) 




2.6 Propagators and Feynman Path Integrals 



127 



The final expression for the transition amplitude with tN ~ t\ finite is 



{x N ,t N \xi,ti) 



lim ( — - — 
N -+00 \2nUiAt 



(N —])/2 



x / dx N -i / dx tv— 2 



N 

dx 2 ]~[ exp 

n= 2 



iS(n,n — 1) 



(2.6.47) 



where the N — » oo limit is taken with xn and tff fixed. It is customary here to 
define a new kind of multidimensional (in faet, infinite-dimensional) integral op- 
erator 



XN 



( m \ w - 1)/2 r f 

[mm) J dXH-t J iXH-2- 



XI 



Æ)[x(t)] = lim 

N—>oo 



and write (2.6.47) as 



dx 2 

(2.6.48) 



(XN,tN\Xl,tl) 




Æ)[x(t)]exp 



*7 

. Ju 



tN , T c Iassical(x,x) 

dt 



(2.6.49) 



This expression is known as Feynman’s path integral. Its meaning as the sum 
over all possible paths should be apparent from (2.6.47). 

Our steps leading to (2.6.49) are not meant to be a derivation. Rather, we (fol- 
lowing Feynman) have attempted a new formulation of quantum mechanics based 
on the concept of paths, motivated by Dirac’s mysterious remark. The only ideas 
we borrowed from the conventional form of quantum mechanics are (1) the su- 
perposition principle (used in summing the contributions from various alternative 
paths), (2) the composition property of the transition amplitude, and (3) classical 
correspondence in the Ti — x 0 limit. 

Even though we obtained the same result as the conventional theory for the 
free-particle case, it is now obvious, from what we have done so far, that Feyn- 
man’s formulation is completely equivalent to Schrodinger’s wave mechanics. 
We conclude this section by proving that Feynman’s expression for ( XN,tN\x\,t\ ) 
indeed satisfies Schrodinger’s time-dependent wave equation in the variables 
xnJn, just as the propagator defined by (2.6.8) does. 

We start with 



{x N ,tN\X\,t\) = j dXff-l(Xff,tN\XN-l,tN-l)(XN-l,tN-l\Xl,tl) 

' im\ (xn — xn -\) 2 iV At' 



dXN-l 



m 



2nihAt 



exp 



2 ti 



At 



x(XN-l,tN-l\xi,tl), 

where we have assumed tN — tN - 1 to be infinitesimal. Introducing 



h J 

(2.6.50) 



£ — XN - XN - 1 



(2.6.51) 




128 



Chapter 2 Quantum Dynamics 



and letting xn x and t# t + At, we obtain 



{x,t + At\x\,ti) 




/ im £ 2 
\2hAt 



iVAt 

h 



(x-£,t|xi,ti). 

(2.6.52) 



As is evident from (2.6.45b), in the limit At — > 0, the major contribution to this 
integral comes from the £ ~ 0 region. It is therefore legitimate to expand (x — 
£,t|xi,ti) in powers of £. Wealso expand (x,t + At\x\,t\) and exp(— i V At /ti) in 
powers of At, so 



(x,t\x\,t\) + At— {x,t\x\,t\) 

at 



m 



2iti%At 
{x,t \x \ , t\ ) + 



iV At 



£>-(£)(- , 

/£ 2 \ 9 2 



(2.6.53) 



where we have dropped a term linear in £ because it vanishes when integrated 
with respect to £. The (x,t\x\,t\) term on the left-hand side just matches the 
leading term on the right-hand side because of (2.6.45a). Collecting terms that are 
first-order in At, we obtain 



At — {x,t\xi,ti) 
at 




1 9 2 

-—z{x,t\x h ti) 




AtV (x,t|xi,ti). 



(2.6.54) 



where we have used 



£ 



2 / tm£ 2 \ r— /ihAt\ 3 ^ 2 

exp (2^7j = ' Æ '(— ) • 



(2.6.55) 



obtained by differentiating (2.6.45a) with respect to At. In this manner we see 
that (x,t|xi,ti) satisfies Schrodinger’s time-dependent wave equation: 



9 h 2 \ 9 2 

ih — {x,t\xi,h) = - I — r(x,t|xi,fi) + V(x,t|xi,ti). (2.6.56) 



9t 



2 m I dx 2 



Thus we can conclude that (x,t|xi,ti) constructed according to Feynman’s pre- 
scription is the same as the propagator in Schrodinger’s wave mechanics. 

Feynman’s space-time approach based on path integrals is not too convenient 
for attacking practical problems in nonrelativistic quantum mechanics. Even for 
the simple harmonic oscillator, it is rather cumbersome to evaluate explicitly the 




2.7 Potentials and Gauge Transformations 



129 



relevant path integral.* However, his approach is extremely gratifying from a con- 
ceptual point of view. By imposing a certain set of sensible requirements on a 
physical theory, we are inevitably led to a formalism equivalent to the usual for- 
mulation of quantum mechanics. It makes us wonder whether it is at all possible 
to construct a sensible alternative theory that is equally successful in accounting 
for microscopic phenomena. 

Methods based on path integrals have been found to be very powerful in other 
branches of modern physics, such as quantum held theory and statistical mechan- 
ics. In this book the path-integral method will appear again when we discuss the 
Aharonov-Bohm effectJ 



2.7 ■ POTENTIALS AND GAUGE TRANSFORMATIONS 
Constant Potentials 

In classical mechanics it is well known that the zero point of the potential energy 
is of no physical significance. The time development of dynamic variables such as 
x(t) and L(t) is independent of whether we use V (x) or V (x) + Vo with Vo constant 
both in space and time. The force that appears in Newton’s second law depends 
only on the gradient of the potential; an additive constant is clearly irrelevant. 
What is the analogous situation in quantum mechanics? 

We look at the time evolution of a Schrodinger-picture State ket subj eet to some 
potential. Let \a,to',t) be a State ket in the presence of V(x), and let \a,to',t) be 
the corresponding State ket appropriate for 

V(x)= V(x) + V 0 . (2.7.1) 

To be precise, let’s agree that the initial condi tions are such that both kets coincide 
with |a) at t — to. If they represent the same physical situation, this can always 
be done by a suitable choice of the phase. Recalling that the State ket at t can be 
obtained by applying the time-evolution operator U(t, to) to the State ket at to, we 
obtain 



\a,to;t) = exp 



P z \ (t — to) 



= exp 



-iVo(t-to)' 

h 



\a,tQ-,t). 



(2.7.2) 



In other words, the ket computed under the influence of V has a time dependence 
different only by a phase factor exp [—i Vo(t — to) /ti]. For stationary States, this 
means that if the time dependence computed with V(x) is exp[— i E(t — to) /ti], 

*The reader is challenged to solve the simple harmonic oscillator problem using the Feynman 
path-integral method in Problem 2.34 of this chapter. 

I The reader who is interested in the fundamentals and applications of path integrals may consult 
Feynman and Hibbs (1965) and also Zee (2010). 




130 



Chapter 2 Quantum Dynamics 



then the corresponding time dependence computed with V (x) + Vo is exp [— i (E + 
Vo)(t — to)/h]. In other words, the use of V in place of V just amounts to the 
folio wing change: 



E-^E+Vo, (2.7.3) 

which the reader probably guessed immediately. Observable effects such as the 
time evolution of expectation values of (x) and (S) always depend on energy dif- 
ferences [see (2.1.47)]; the Bohr frequencies that characterize the sinusoidal time 
dependence of expectation values are the same whether we use V (x) or V (x) + Vo. 
In general, there can be no difference in the expectation values of observables if 
every State ket in the world is multiplied by a common factor exp[ — i Vo(t — to )/Ti\ . 

Trivial as it may seem, we see here the first example of a class of transfor- 
mations known as gauge transformations. The change in our convention for the 
zero-point energy of the potential 

V(x) -* V (x) + Vo (2.7.4) 



must be accompanied by a change in the State ket 

-iVo(t-to) 



\a,t Q \t) -» exp 



h 



I oc,to;t). 



(2.7.5) 



Of course, this change implies the following change in the wave function: 

-iVo(t-to) 



i fr(\',t) -> exp 



h 






(2.7.6) 



Next we consider Vo that is spatially uniform but dependent on time. We then 
easily see that the analogue of (2.7.5) is 



\a,to',t) --»• exp 





WT 

h 



a,to\t). 



(2.7.7) 



Physically, the use of V(x) + Vo(t) in place of V(x) simply means that we are 
choosing a new zero point of the energy scale at each instant of time. 

Even though the choice of the absolute scale of the potential is arbitrary, poten- 
tial differences are of nontrivial physical significance and, in faet, can be detected 
in a very striking way. To illustrate this point, let us consider the arrangement 
shown in Figure 2.8. A beam of charged particles is split into two parts, each of 
which enters a metallic cage. If we so desire, we can maintain a fi nite potential dif- 
ference between the two cages by turning on a switch, as shown. A particle in the 
beam can be visualized as a wave packet whose dimension is mueh smaller than 
the dimension of the cage. Suppose we switch on the potential difference only 
after the wave packets enter the cages and switch it off before the wave packets 
leave the cages. The particle in the cage experiences no force because inside the 
cage the potential is spatially uniform; hence no electric held is present. Now let 
us recombine the two beam components in such a way that they meet in the inter- 
ference region of Figure 2. 8. Because of the existence of the potential, each beam 




2.7 Potentials and Gauge Transformations 



131 




Interference 

region 



FIGURE 2.8 Quantum-mechanical interference to detect a potential difference. 



component suffers a phase change, as indicated by (2.7.7). As a result, there is 
an observable interference term in the beam intensity in the interference region, 
namely, 



cos(øi - ø 2 ), sin(øi - ø 2 ), (2.7.8) 

where 

øt ~<h = (ø J* f dt[V 2 (t) - Vi (/)]. (2.7.9) 

So despite the faet that the particle experiences no force, there is an observable 
effeet that depends on whether V 2 (0 — Vi(f) has been applied. Notice that this ef- 
feet is purely quantum-mechanical, in the limit ti — »■ 0, the interesting interference 
effeet gets washed out because the oscillation of the cosine becomes infinitely 
rapid.* 

Gravity in Quantum Mechanics 

There is an experiment that exhibits in a striking mannerhow a gravitational effeet 
appears in quantum mechanics. Before describing it, we first comment on the 
role of gravity in both classical and quantum mechanics. Consider the classical 
equation of motion for a purely falling body: 



mx = — mVcbgrav = —mgz. (2.7.10) 

The mass term drops out, so in the absence of air resistance, a feather and a stone 
would behave in the same way — å la Galileo — under the influence of gravity. 
This is, of course, a direct consequence of the equality of the gravitational and 
the inertial masses. Because the mass does not appear in the equation of a particle 
trajectory, gravity in classical mechanics is often said to be a purely geometric 
theory. 

*This gedanken experiment is the Minkowski-rotated form of the Aharonov-Bohm experiment to 
be discussed later in this section. 





132 



Chapter 2 Quantum Dynamics 



The situation is rather different in quantum mechanics. In the wave-mechanical 
formulation, the analogue of (2.7.10) is 



— V +mO grav 



d\jr 

\)r = ih — . 
Y 9 1 



(2.7.11) 



The mass no longer cancels; instead it appears in the combination h/m, so in a 
problem where h appears, m is also expected to appear. We can see this point also 
using theFeynman path-integral formulation of a falling body based on 

, l v / ™ 

{X n ,t n \x n -l,t n -l) = / A exp 

v ImhAt 

( tn tn— 1 = At — > 0 ). 



dt 



(2 



'tn - 1 



|mx 2 — mgzj 

h 



(2.7.12) 



Here again we see that m appears in the combination m/h. This is in sharp contrast 
with Hamilton’s classical approach based on 



5 





= 0, 



(2.7.13) 



where m can be eliminated in the very beginning. 

Starting with the Schrodinger equation (2.7.1 1), we may derive the Ehrenfest 
theorem 



^ 2 <x)=-gz. (2.7.14) 

However, h does not appear here, nor does m. To see a nontrivial quantum- 
mechanical effect of gravity, we must study effects in which h appears explicitly — 
and consequently where we expect the mass to appear — in contrast with purely 
gravitational phenomena in classical mechanics. 

Until 1975, there had been no direct experiment that established the presence 
of the m Ograv term in (2.7.11). To be sure, a free fall of an elementary particle 
had been observed, but the classical equation of motion — or the Ehrenfest theo- 
rem (2.7.14), where h does not appear — sufficed to account for this. The famous 
“weight of photon” experiment of V. Pound and collaborators did not test gravity 
in the quantum domain either, because they measured a frequency shift where h 
does not explicitly appear. 

On the microscopic scale, gravitational forces are too weak to be readily ob- 
servable. To appreciate the difficulty involved in seeing gravity in bound-state 
problems, let us consider the ground State of an electron and a neutron bound 
by gravitational forces. This is the gravitational analogue of the hydrogen atom, 
where an electron and a proton are bound by Coulomb forces. At the same dis- 
tance, the gravitational force between the electron and the neutron is weaker than 
the Coulomb force between the electron and the proton by a factor of ~ 2 x 10 . 




2.7 Potentials and Gauge Transformations 



133 



Interference region 

\l/ 




FIGURE 2.9 Experiment to detect gravity-induced quantum interference. 



The Bohr radius involved here can be obtained simply: 

h~ h 2 

<20 = -o » 7 ; 2 — ’ (2.7.15) 

e z m e UNmjmn 

where G at is Newton’s gravitational constant. If we substitute numbers in the 
equation, the Bohr radius of this gravitationally bound system turns out to be 
~ 10 31 , or ~ 10 13 light years, which is larger than the estimated radius of the 
uni verse by a few orders of magnitude! 

We now discuss a remarkable phenomenon known as gravity-induced quan- 
tum interference. A nearly monoenergetic beam of particles — in practice, ther- 
mal neutrons — is split into two parts and then brought together as shown in 
Figure 2.9. In actual experiments the neutron beam is split and bent by Silicon 
crystals, but the details of this beautiful art of neutron interferometry do not 
concem us here. Because the wave packet can be assumed to be much smaller 
than the macroscopic dimension of the loop formed by the two alternative paths, 
we can apply the concept of a classical trajectory. Let us first suppose that path 
A — > B -» D and path A — > C — > D lie in a horizontal plane. Because the abso- 
lute zero of the potential due to gravity is of no significance, we can set V = 0 for 
any phenomenon that takes place in this plane; in other words, it is legitimate to 
ignore gravity altogether. The situation is very different if the plane formed by the 
two alternative paths is rotated around segment AC by 5. This time the potential 
at level BD is higher than that at level AC by mgh sin 8, which means that the 
State ket associated with path BD “rotates faster.” This leads to a gravity-induced 
phase difference between the amplitudes for the two wave packets arri ving at D. 
Actually there is also a gravity-induced phase change associated with AB and also 
with CD, but the effects cancel as we compare the two alternative paths. The net 
result is that the wave packet arriving at D via path ABD suffers a phase change 



-irringh (sin 5)7' 
h . 



(2.7.16) 




134 



Chapter2 Quantum Dynamics 



relative to that of the wave packet arriving at D via path ACD, where T is the time 
spent for the wave packet to go from B to D (or from A to C) and m„, the neutron 
mass. We can control this phase difference by rotating the plane of Figure 2.9; 8 
can change from 0 to 7T /2, or from 0 to — n/2. Expressing the time spent T, or 
h / Vwavepackeu in terms of X, the de Broglie wavelength of the neutron, we obtain 
the following expression for the phase difference: 



(pABD (pACD = - 



(m^g/j^X sin S) 

tf 



(2.7.17) 



In this manner we predict an observable interference effect that depends on angle 
8, which is reminiscent of fringes in Michelson-type interferometers in optics. 

An alternative, more wave-mechanical way to understand (2.7.17) follows. Be- 
cause we are concemed with a time-independent potential, the sum of the kinetic 
energy and the potential energy is constant: 

P 2 

+mgz = E . (2.7.18) 

Im 

The difference in height between level BD and level AC implies a slight difference 
in p, or X. As a result, there is an accumulation of phase differences due to the 
X difference. It is left as an exercise to show that this wave-mechanical approach 
also leads to result (2.7.17). 

What is interesting about expression (2.7.17) is that its magnitude is neither 
too small nor too large; it is just right for this interesting effect to be detected 
with thermal neutrons traveling through paths of “table-top” dimensions. For 
X = 1.42 Å (comparable to interatomic spacing in Silicon) and / 1/2 = 10 cm 2 , we 
obtain 55.6 for mf t gl\l 2 ^/tf . As we rotate the loop plane gradually by 90°, we 
predict the intensity in the interference region to exhibit a series of maxima and 
minima; quantitatively we should see 55.6/27 r ~ 9 oscillations. It is extraordi- 
nary that such an effect has indeed been observed experimentally ; see Figure 2.10, 
which is from a 1975 experiment of R. Colella, A. Overhauser, and S. A. Werner. 
The phase shift due to gravity is seen to be verified to well within 1%. 

We emphasize that this effect is purely quantum-mechanical because as ti — > 0, 
the interference pattern gets washed out. The gravitational potential has been 
shown to enter into the Schrodinger equation just as expected. This experiment 
also shows that gravity is not purely geometric at the quantum level because the 
effect depends on ( m/ti ) 2 .* 



Gauge Transformations in Electromagnetism 

Let us now tum to potentials that appear in electromagnetism. We consider an 
electric and a magnetic held derivable from the time-independent scalar and vec- 

*However, this does not imply that the equivalence principle is unimportant in understanding 
an effect of this sort. If the gravitational mass (mg ra v) and inertial mass (røi ne rt) were unequal, 
( m /ti ) 2 would have to be replaced by m srav m[ nert /h 2 . The faet that we could correctly predict the 
interference pattem without making a distinetion between m grav and OT; nert shows some support 
for the equivalence principle at the quantum level. 




2.7 Potentials and Gauge Transformations 



135 




8 



FIGURE 2.10 Dependence of gravity-induced phase on angle of rotation S. From 
R. Colella, A. W. Overhauser, and S. A. Werner, Phys. Rev. Lett. 34 (1975) 1472. 



tor potential, ø(x) and A(x): 

E = — Vø, B = V x A. (2.7.19) 

The Hamiltonian for a particle of electric charge e (e < 0 for the electron) sub- 
jected to the electromagnetic held is taken from classical physics to be 

1 / e A \ 2 

H = — P +ec(>. (2.7.20) 

2 m \ c / 

In quantum mechanics ø and A are understood to be functions of the position 
operator x of the charged particle. Because p and A do not commute, some care 
is needed in interpreting (2.7.20). The safest procedure is to write 

-> P 2 ~ Q(p-A + A-p)+(^) A 2 . (2.7.21) 

In this form the Hamiltonian is obviously Hermitian. 

To study the dynamics ofa charged particle subjected to ø and A, let us first 
proceed in the Heisenberg picture. We can evaluate the time derivative of x in a 
straigh tforward manner as 

dxj _ [xi,H ] ( pi-eAi/c ) 

dt ih 



m 



(2.7.22) 





136 



Chapter 2 Quantum Dynamics 



which shows that the operator p, defined in this book to be the generator of trans- 
lation, is not the same as m d x/dt. Quite often p is called canonical momentum, 
as distinguished from kinematical (or mechanical) momentum, denoted by ff : 



dx eA 

u = m — = p . 

dt c 



(2.7.23) 



Even though we have 



[Pi,pj] = 0 (2.7.24) 

f or canonical momentum, the analogous commutator does not vanish for mechan- 
ical momentum. Instead we have 



, ihe , 

[ri; , n ;] = I ) SijkBk , 



(2.7.25) 



as the reader may easily verify. Rewriting the Hamiltonian as 

n 2 

H = b ecp 

2 m 



(2.7.26) 



and using the fundamental commutation relation, we can derive the quantum- 
mechanical version of the Lorentz force, namely, 



d 2 x dl\ 

m — T = — - =e 
dt 2 dt 



r i 


( dx 


dx\~ 


E +x- 


— xB 


— B x — ) 


2c 


\dt 


dt ) 



(2.7.27) 



This then is Ehrenfest’s theorem, written in the Heisenberg picture, for the 
charged par ticle in the presence of E and B. 

We now study Schrodinger’s wave equation with <fi and A. Our first task is to 
sandwich H between (x 7 | and \a, to ; t). The only term with which we have to be 
careful is 



(x'| 



P- 



eA(x) 



I ot, to; t ) 

-ihv'- 



, eA(x') 



-ihV- 



eA(x') 



P 

ihV- 



eA(x) 
c 

eA(x') 



\a,to;t) 



(xV.fo; t). 

(2.7.28) 



It is important to emphasize that the first V' in the last line can differentiate both 
(x'la, to; t) and A(x 7 ). Combining everything, we have 



1 

2 m 



-ihV- 



eA(x') 



-ihV ■ 

a 



eA(x’) 



{x'\a,to;t) 



(2.7.29) 



+ eø(x')(x' \a, to; t ) = ih — (x'\a, to; t). 

at 




